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Box Patent Application 

Assistant Commissioner for Patents 

Washington D.C 20231 

PRELIMINARY AMENDMENT 

Dear Sir: 

Applicants respectfully request that a Preliminary Amendment for the above-identified 
Application be entered as contained herein. 

In the claims: 

In this patent application, please consider under prosecution claims 1-18 wherein 
amended claims 1, 3, 6, 8-11, and 13-18 are entered as follows: 

1 . (Amended) Process for oxidising a substrate which is an acyclic or cyclic terpene, or a 
cycloalkene or a substituted derivative thereof, and which process comprises: 
oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of an amino acid in an active site by an amino acid with a 
less polar side-chain. 
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3. (Amended) Process according to claim 2 in which the enzyme is one in which amino acid 
47 or 51 or 47 and 51 of P450 BM _ 3 , or amino acid 96 of P450 cam , or the equivalent 
amino acid in a said homologue, have been changed to an amino acid with a less 
polar side-chain. 

6. (Amended) The enzyme as defined in claim 4 excluding mutants of P450 cam which only 
have the mutations F87A-Y96G-F193A, F87A-Y96G-F193A-C334A, or T101M- 
T185F-V247M. 

8. (Amended) A cell which expresses: 

(i) a mutant haem-containing enzyme comprising a substitution of an amino acid in an active 
site by an amino acid with a less polar side-chain which in its naturally occurring form 
has an electron transfer reductase domain, or 

(ii) (a) a mutant haem-containing enzyme comprising a substitution of an amino acid in 

an active site by an amino acid with a less polar side-chain, 

(b) an electron transfer reductase, and 

(c) an electron transfer redoxin; or 

(iii) (a) (1) P450 cam5 or a fragment thereof; or 

(2) a naturally occurring homologue of P450 cam or a fragment thereof; or 

(3) a mutant P450 cam , or a mutant of a naturally occurring homologue of 
P450 cam comprising a substitution of an amino acid in an active site by an amino 
acid with a less polar side-chain; or 

(4) a P450 cam which has at least 70% amino acid homology with (1 ), (2) or (3) 
and optionally in which amino acid 96 has been changed to an amino acid with 
a less polar side-chain; and 

(b) an electron transfer reductase; and 
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(c) 



an electron transfer redoxin, or 



(iv) (a) (1) P450 BM _ 3 , or a fragment thereof; or 

(2) a naturally occurring homologue of P450 BM _ 3 or a fragment thereof; or 

(3) a mutant P450 BM _ 3? or a mutant of a naturally occurring homologue of 
P450 BM _ 3 comprising a substitution of an amino acid in an active site by an amino 
acid with a less polar side-chain 

excluding an E. Coli DH5a cell in which the only mutants of P450 cam which are 
expressed are amongst the following: 

H 2 N-P450 cam -TDGTSST~putidaredoxin reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPSST-putidaredoxin reductase-TDGASSS-putidaredoxin- 
COOH, 

H 2 N-P450 cara -TDGTRPGPGPGPGPGPGPSST-putidaredoxin reductase-TDGASSS 
putidaredoxin-COOH ? 

H 2 N-450 cam -putidaredoxin reductase-TDGASSS-putidaredoxin-PLEL-P450 cam -COOH. 

9. (Amended) The cell according to claim 8 in which (a) ; (b) and (c) or (b) and (c) are 

expressed together in the same fusion protein. 

10. (Amended) The cell according to claim 8 in which: 

(b) is putidaredoxin reductase or a fragment thereof; and/or 

(c) is putidaredoxin or a fragment thereof. 
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1 1 . (Amended ) Process for oxidising a substrate which is an acyclic or cyclic terpene, or a 
cycloalkene, or a substituted derivative thereof, and which process comprises 
oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of a first amino acid in an active site by an amino acid with 
a less polar side-chain, wherein the substrate is oxidised in a cell according to claim 8. 

13. (Amended) Process for selecting a mutant of P450 cam or P450 BM _ 3 , or a homologue 

thereof, for its ability to oxidise a substrate, which process comprises screening a 
group of said mutants for their oxidation effect on the substrate. 

14. (Amended) Process according to claim 13 in which the mutant is additionally selected 

for its ability to oxidise the substrate to an oxidation product. 

15. (Amended) Process for selecting a mutant of P450 cam or P450 BM _ 3 , or a homologue 

thereof, for its ability to oxidise a substrate, which process comprises screening a 
group of said mutants for their oxidation effect on the substrate in which the screening 
is carried out on the library made in a process according to claim 12. 

16. (Amended) A process for producing a library of oxidation products comprising 

providing an acyclic or cyclic terpene, or a cycloalkene, or a substituted derivative 
thereof to a library made in a process according to claim 12 and allowing oxidation of 
the substrate. 

17. (Amended) A method of treatment of a human or an animal comprising administering an 

oxidation product obtained by oxidising a substrate which is an acyclic or cyclic 
terpene, or a cycloalkene, or a substituted derivative thereof, and which process 
comprises oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of an amino acid in an active site by an amino acid with a 
less polar side-chain, wherein optionally the enzyme is one which has been selected in 
a process according to claim 13. 
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18. (Amended) A pharmaceutical composition comprising an oxidation product obtained 
by oxidising a substrate which is an acyclic or a cyclic terpene, or a cycloalkene, or a 
substituted derivative thereof, and which process comprises oxidising said substrate 
with a mutant haem-containing enzyme, the mutant comprising a substitution of an 
amino acid in an active site by an amino acid with a less polar side-chain, wherein 
optionally the enzyme is one which has been selected in a process according to claim 
13 and a pharmaceutically acceptable carrier or diluent. 
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REMARKS 



Applicants have attached a marked version of the claims in this application. For the 
Examiner's convenience, Applicants have also attached a clean copy of all pending claims in 
this application after amendments. 



Fulbright & Jaworski L.L.P. 
1301 McKinney, Suite 5100 
Houston, Texas 77010-3095 
Telephone: 713-651-5325 
Telecopier: 713-651-5246 



Respectfully submitted, 





Thomas D. Paul 
Registration No. 32,714 
Counsel for Applicants 
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MARKED VERSION OF CLAIMS SHOWING AMENDMENTS 



1. (Amended) Process for oxidising a substrate which is an acyclic or cyclic terpene [;] A or 
a cycloalkene or a substituted derivative thereof, and which process comprisesi 
oxidising said [compound] substrate with a mutant haem-containing enzyme, the 
mutant comprising [the] a substitution of an amino acid in [the] an active site by an 
amino acid with a less polar side-chain. 

3. (Amended) Process according to claim 2 in which the enzyme is one in which amino 
acid 47 [and/] or 51 or 47 and 51 of P450 BM _ 3 , or amino acid 96 of P450 cam5 or the 
equivalent amino acid[(s)] in a said homologue, have been changed to an amino acid 
with a less polar side-chain 

6. (Amended) [An enzyme] The enzyme as defined in claim 4 excluding mutants of P450 C3m 
which only have the mutations F87A-Y96G-F193A, F87A-Y96G-F193A- C334A, or 
T 1 0 1 M-T 1 8 5F-V247M. 

8. (Amended) A cell which expresses: 

(i) a mutant haem-containing enzyme comprising [the] a substitution of an amino acid in 
[the] an active site by an amino acid with a less polar side-chain which in its naturally 
occurring form has an electron transfer reductase domain, or 

(ii) (a) a mutant haem-containing enzyme comprising [the] a substitution of an amino 

acid in [the] an active site by an amino acid with a less polar side-chain, 

(b) an electron transfer reductase, and 
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(c) an electron transfer redoxin; or 

(iii) (a) (1) P450 cam? or a fragment thereof; or 

(2) a naturally occurring homologue of P450 cam or a fragment thereof; or 

(3) a mutant P450 cam , or a mutant of a naturally occurring homologue of 
P450 cam comprising [the] a substitution of an amino acid in [the] an active 
site by an amino acid with a less polar side-chain; or 

(4) a P450 cam which has at least 70% amino acid homology with (1), (2) or (3) 
and optionally in which amino acid 96 has been changed to an amino acid 
having a less polar side-chain; and 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, or 

(iv) (a) (1) P450 BM _ 3 , or a fragment thereof; or 

(2) a naturally occurring homologue of P450 BM „ 3 or a fragment thereof; or 

(3) a mutant P450 BM _ 3? or a mutant of a naturally occurring homologue of 
P450 BM _ 3 comprising [the] a substitution of an amino acid in [the] an active 
site by an amino acid with a less polar side-chain 

excluding an E. Coli DH5a cell in which the only mutants of P450 cam which are expressed are 
amongst the following: 

H 2 N-P450 cam -TDGTSST-putidaredoxin reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPSST-putidaredoxin reductase-TDGASSS-putidaredoxin- 
COOH, 
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H 2 N-P450 cam -TDGTRPGPGPGPGPGPGPSST-putidaredoxin reductase-TDGASSS 
putidaredoxin-COOH, 

H 2 N-450 cam -putidaredoxin reductase-TDGASSS-putidaredoxin-PLEL-P450 cam -COOH. 

9. (Amended) [A] The cell according to claim 8 in which (a), (b) and (c) or (b) and (c) are 

expressed together in the same fusion protein. 

10. (Amended) [A] The cell according to claim 8 in which: 

(b) is putidaredoxin reductase or a fragment thereof; and/or 

(c) is putidaredoxin or a fragment thereof. 

1 1 . (Amended ) Process for oxidising a substrate which is an acyclic or cyclic terpene, or a 

cycloalkene, or a substituted derivative thereof!;] 4 and which process comprises 
oxidising said [compound] substrate with a mutant haem-containing enzyme, the 
mutant comprising [the] a substitution of a first amino acid in [the] an active site by 
an amino acid with a less polar side-chain, wherein the [compound] substrate is 
oxidised in a cell according to claim 8. 

13. (Amended) Process for selecting a mutant of P450 cam or P450 BM _ 3 , or a homologue 

thereof, for its ability to oxidise a [particular] substrate, which process 
comprises screening a group of said mutants for their oxidation effect on the 
[particular] substrate. 

14. (Amended) Process according to claim 13 in which the mutant is additionally selected 

for its ability to oxidise the [particular compound] substrate to [a] an 
[particular] oxidation product. 
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15. (Amended) Process [according to claim 13] for selecting a mutant of P450^ m or P450 PM _ ? , 

or a homologue thereof, for its ability to oxidise a substrate, which process comprises 
screening a group of said mutants for their oxidation effect on the substrate in which 
the screening is carried out in a library made in a process according to claim 12. 

16. (Amended) A process for producing a library of oxidation products comprising 

providing an acyclic or cyclic terpene, or a cycloalkenef^ or a substituted derivative 
thereof to a library made in a process according to claim 12 and allowing oxidation of 
the substrate. 

17. (Amended) A method of treatment of a human or an animal comprising administering an 

oxidation product obtained by oxidising a substrate which is an acyclic or cyclic 
terpene, or a cycloalkenef;]^ or a substituted derivative thereof, and which 
process comprises oxidising said substrate with a mutant haem-containing enzyme, 
the mutant comprising [the] a substitution of an amino acid in [the] an active site by 
an amino acid with a less polar side-chain, wherein optionally the enzyme is one 
which has been selected in a process according to claim 13. 

18. (Amended) A pharmaceutical composition comprising an oxidation product obtained by 

oxidising a substrate which is an acyclic or a cyclic terpene, or a 

cycloalkenef;^ or a substituted derivative thereof, and which process comprises 

oxidising said [compound] substrate with a mutant haem-containing 

enzyme, the mutant comprising [the] a substitution of an amino acid in [the] an active 

site by an amino acid with a less polar side-chain, wherein optionally the enzyme is 

one which has been selected in a process according to claim 13 and a 

pharmaceutically acceptable carrier or diluent. 
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1 . (Amended) Process for oxidising a substrate which is an acyclic or cyclic terpene, or a 

cycloalkene or a substituted derivative thereof, and which process comprises: 
oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of an amino acid in an active site by an amino acid with a 
less polar side-chain. 

2. Process according to claim 1 in which the enzyme is a mutant of P450 cam or P450 BM _ 3 , or a 

mutant of a naturally occurring homologue of either of these enzymes. 

3. (Amended) Process according to claim 2 in which the enzyme is one in which amino acid 

47 or 51 or 47 and 51 of P450 BM _ 3 , or amino acid 96 of P450 cam , or the equivalent 
amino acid in a said homologue, have been changed to an amino acid with a less 
polar side-chain. 

4. Process according to claim 1 in which there are one or more other amino acid substitutions 

in the active site. 

5. Process according to claim 1 in which the enzyme is (i) P450 cam and comprises one or 

more of the following mutations: F87W, F87I, F87L, T185L, T185F, V247A, V247L 
or F87A-I395F; or (ii) P450 BM _ 3 and comprises the mutation R47L-Y51F. 

6. (Amended) The enzyme as defined in claim 4 excluding mutants of P450 cam which only 

have the mutations F87A-Y96G-F193A, F87A-Y96G-F193A-C334A, or T101M- 
T185F-V247M. 

7. A polynucleotide which comprises a sequence which encodes an enzyme as defined in 

claim 6. 
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8. (Amended) A cell which expresses: 

(i) a mutant haem-containing enzyme comprising a substitution of an amino acid in an active 
site by an amino acid with a less polar side-chain which in its naturally occurring form 
has an electron transfer reductase domain, or 

(ii) (a) a mutant haem-containing enzyme comprising a substitution of an amino acid in 

an active site by an amino acid with a less polar side-chain, 

(b) an electron transfer reductase, and 

(c) an electron transfer redoxin; or 

(iii) (a) (1) P450 cam) or a fragment thereof; or 

(2) a naturally occurring homologue of P450 cam or a fragment thereof; or 

(3) a mutant P450 cam , or a mutant of a naturally occurring homologue of 
P450 cam comprising a substitution of an amino acid in an active site by an amino 

acid with a less polar side-chain; or 

(4) a P450 cam which has at least 70% amino acid homology with (1), (2) or (3) 
and optionally in which amino acid 96 has been changed to an amino acid with 
a less polar side-chain; and 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin, or 

(iv) (a) (1) P450 BM _ 3 , or a fragment thereof; or 

(2) a naturally occurring homologue of P450 BM „ 3 or a fragment thereof; or 
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(3) a mutant P450 BM _ 3 , or a mutant of a naturally occurring homologue of 
P450 BM _ 3 comprising a substitution of an amino acid in an active site by an amino 
acid with a less polar side-chain 

excluding an E. Coli DH5a cell in which the only mutants of P450 cam which are 
expressed are amongst the following: 

H 2 N-P450 cam -TDGTSST-putidaredoxin reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPSST-putidaredoxin reductase-TDGASSS-putidaredoxin- 
COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPGPGPSST-putidaredoxin reductase-TDGASSS 
putidaredoxin-COOH, 

H 2 N-450 cam -putidaredoxin reductase-TDGASSS-putidaredoxin-PLEL-P450 cam -COOH. 

9. (Amended) The cell according to claim 8 in which (a), (b) and (c) or (b) and (c) are 

expressed together in the same fusion protein. 

10. (Amended) The cell according to claim 8 in which: 

(b) is putidaredoxin reductase or a fragment thereof; and/or 

(c) is putidaredoxin or a fragment thereof. 

1 1 . (Amended ) Process for oxidising a substrate which is an acyclic or cyclic terpene, or a 

cycloalkene, or a substituted derivative thereof, and which process comprises 
oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of a first amino acid in an active site by an amino acid with 
a less polar side-chain, wherein the substrate is oxidised in a cell according to claim 8. 
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12. A process for making a library of mutants of P450 cam or P450 BM _ 3> or mutants of a 

homologue of either of these enzymes comprising contacting a cell according to claim 
8 , or an E. Coli DH5a cell in which the only mutants of P450 cam which are expressed 
are amongst the following: 

H 2 N-P450 cam -TDGTSST-putidaredoxin reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPSST-putidaredoxin reductase-TDGASSS-putidaredoxin- 
COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPGPGPSST-putidaredoxin reductase-TDGASSS- 
putidaredoxin-COOH, 

H 2 N-putidaredoxin reductase-TDGASSS-putidaredoxin-PLEL-P450 cam -COOH; 

with a mutagen and/or when the cell is a mutator cell culturing the cell in conditions in 
which mutants are produced. 

13. (Amended) Process for selecting a mutant of P450 cam or P450 BM _ 3 , or a homologue 

thereof, for its ability to oxidise a substrate, which process comprises screening a 
group of said mutants for their oxidation effect on the substrate. 

14. (Amended) Process according to claim 13 in which the mutant is additionally selected 

for its ability to oxidise the substrate to an oxidation product. 

1 5 . (Amended) Process for selecting a mutant of P450 cam or P450 BM _ 3? or a homologue 

thereof, for its ability to oxidise a substrate, which process comprises screening a 
group of said mutants for their oxidation effect on the substrate in which the 
screening is carried out on the library made in a process according to claim 12. 
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16. (Amended) A process for producing a library of oxidation products comprising 

providing an acyclic or cyclic terpene, or a cycloalkene, or a substituted derivative 
thereof to a library made in a process according to claim 12 and allowing oxidation of 
the substrate. 

17. (Amended) A method of treatment of a human or an animal comprising administering an 

oxidation product obtained by oxidising a substrate which is an acyclic or cyclic 
terpene, or a cycloalkene, or a substituted derivative thereof, and which process 
comprises oxidising said substrate with a mutant haem-containing enzyme, the mutant 
comprising a substitution of an amino acid in an active site by an amino acid with a 
less polar side-chain, wherein optionally the enzyme is one which has been selected in 
a process according to claim 13. 



1 8. (Amended) A pharmaceutical composition comprising an oxidation product obtained 

by oxidising a substrate which is an acyclic or a cyclic terpene, or a cycloalkene, or a 
substituted derivative thereof, and which process comprises oxidising said substrate 
with a mutant haem-containing enzyme, the mutant comprising a substitution of an 
amino acid in an active site by an amino acid with a less polar side-chain, wherein 
optionally the enzyme is one which has been selected in a process according to claim 
13 and a pharmaceutically acceptable carrier or diluent. 
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, PR^E£S^FOR OXIDISING TERgENES 

The invention relates to a process for 
enzymatically oxidising terpenes and cycloalkenes. 

Terpenoid compounds are widespread in biological 
5 systems and constitute one of the largest class of 
natural products. They are major constituents of 
essential oils, some of which are of considerable value 
in the flavour and perfume industries* Many terpenoids 
are also biologically active. Some are anti-bacterial and 

10 anti-fungal agents and thus are of great interest to the 
pharmaceutical industry. Indeed, terpenoids are some of 
the highest added value chemicals. 

The terpenoids of commercial interest are not 
normally the terpenes themselves, but rather derivatives 

15 which commonly require stereoselective functionalisation 
at allylic as well as non-activated carbon-hydrogen bonds 
of the parent terpene. This type of chemical 
transformation is one of the most difficult reaction to 
carry out by conventional methods of chemical synthesis - 

20 the highly reactive chemical oxidising agents required 

are non-selective and typically they will preferentially 
attack more activated carbon-hydrogen bonds and reactive 
functional groups such as olefinic double bonds commonly 
present in terpenes . 

25 The present invention concerns the enzymatic 

oxidation of terpenes and cycloalkenes . This technique 
enables the synthesis of hydroxylated terpenes (and 
cycloalkenes) , often in a single step, and provided that 
the match between substrate and enzyme is correct, the 

30 oxidation reaction can be highly chemoselective (attack 
at a particular functional group such as a non-activated 
C-H bond rather than some other reactive functional 
group) and stereoselective. The fine tuning and 
alterations of substrate specificity and selectivity of 

35 substrate oxidation are very difficult to achieve for 
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conventional reagents. 

The present invention provides a process for 
oxidising a substrate which is an acyclic or cyclic 
terpene or a cycloalkene, or a substituted derivative 
5 thereof, which process comprises oxidising said compound 
with a mutant haem-containing enzyme/ the mutant 
comprising the substitution of an amino acid in the 
active site by an amino acid with a less polar side- 
chain. 

10 Although the terpenes used in the present invention 

will generally have the formula (C 5 H 8 ) n where n is 2 or 
more, especially 2, 3 or 4, it is to be understood that 
the term * terpene" extends to compounds which are 
strictly referred to as * terpenoid" , involving the loss 

15 or shift of a fragment, generally a methyl group* Thus, 
for example, sesquiterpenes (where n is 3) which can be 
used in the present invention may contain only, say, 14, 
rather than 15, carbon atoms* Generally the terpene is 
one which can be built up from isoprene units. The 

20 terpene may be cyclic or acyclic. 

The monoterpenes (where n is 2) will generally have 
10 carbon atoms, typically with 1 to 3 double bonds, 
especially 1 or 2 ring double bonds, and typically with 0 
to 2 rings. It is possible for one of the rings to be 

25 formed as a bridge containing, typically 0 or 1 carbon 

atoms. In other words, it can be formed by a direct link 
between 2 carbon atoms of an existing ring or with an 
intermediate methylene group. If the terpene is acyclic 
it will generally contain at least 2 double bonds and 

30 generally 3. 

The sesquiterpenes will normally contain 14 or 15 
carbon atoms, typically with 0 to 2 double bonds and 
typically 1 to 3 rings, with the possibility of fused 
rings and/or bridged rings. 

35 The rings which may be present in the terpenes will 
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typically have from 3 to 9 carbon atoms , more especially 
5 or 6 carbon atoms. Thus, in particular, the terpenes 
will contain a cyclohexane, or cyclohexadiene ring. 

The terpenes will generally contain a total of 3 or 
5 4 exocyclic methyl or methylene groups, for example 2 
methyl groups and 1 methylene group or 3 methyl groups 
for a monoterpene, and 3 methyl groups and 1 methylene 
group or 4 methyl groups for a sesquiterpene. 

The monoterpene is typically a limonene, pinene, 

10 terpinene, sabinene, thujene, mercene, ocimeme, nerol or 
geraniol, for example as shown in Table 1. 

The sesquiterpene is generally formed by a head-to- 
tail arrangement of three isoprene units. The 
sesquiterpene is typically an aromadendrene, 

15 caryophyllene, longifolene, valencene, isobazzanene, 
silphinene, ishwarane, isopatchchoul-3-ene, or 
isosesquicarene, for example as shown in Table 2. 

The diterpene (where n is 4) is typically casbene, 
retinal, abietic acid or a gibberellin. 

20 The cycloalkene generally comprises up to 9 ring 

members, e.g. it is a 5, 6, 7, 8, 9 or more member ed 
ring. The cycloalkene is typically a cyclohexene. 

Substituted derivatives of any of the terpenes or 
cycloalkenes mentioned above may also be used. Typically 

25 1, 2, 3 or more substituents are present. Any 

combination of the following substituents may be present. 
The substituent is typically a halogen atom or an alkyl 
or alkenyl group, which generally has 1 to 6 carbons, the 
substituent optionally being substituted with one or more 

30 halogens. It is generally not phenylcyclohexene . Indeed 
the presence of aromatic, such as phenyl, rings is 
generally avoided for all the substrates used in the 
invention. 

The substituent typically has the formula c n H k X B/ 
35 wherein X is the halogen, n is 1, 2, 3 or more, m is 1, 
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2, 3, 4 or more and k is an integer which has an 
appropriate value so that the valencies of the 
substituent C n H k X m are satisfied. For an alkyl 
substituent k+m - 2n+l. Typically k is 1, 2, 3, 4 or 
5 more, or may be O, i.e. the substituent is a perhaloalkyl 
group. The halogen is typically fluorine, chlorine or 
bromine . 

The substituent may also comprise 1, 2 or more 
oxygen atoms and for example may be an alcohol, aldehyde, 
10 ketone or epoxide group. 
"1 The oxidation causes the formation of a C-0 bond 

ii in the compound, generally as the hydroxide from the 

;1 oxidation of a carbon-hydrogen bond, but an epoxide may 

j be formed from the oxidation of a C=C bond. The 

ilS oxidation may thus introduce a hydroxy, aldehyde, ketone 

or epoxide group. Alternatively the oxidation may cause 
■f the further oxidation of an oxygen containing group, such 

\2 as converting a hydroxy group into an aldehyde or ketone 

• } group. 1, 2 or more carbon atoms may be attacked in the 

120 same substrate molecule. 

The oxidation typically gives rise to 1, 2 or more 
oxidation products. These different products may result 
from different carbon atoms being attacked and/or from 
different degrees of oxidation occurring at a given 
25 carbon atom. 

The oxidation may occur on either a ring carbon 
atom or a substituent carbon atom or both. At least the 
initial oxidation will involve attack of a C-H bond which 
may be activated or non-activated or attack at a carbon- 
30 carbon double bond (typically giving an epoxide) . 

Generally an activated C-H bond is where the carbon atom 
is in a benzylic or allyl position. Aromatic rings and 
olefinic double bonds activate C-H bonds to attack by 
stabilising the radical intermediate or any build-up of 
35 charge generated during the reaction pathway. The 
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carbon of the C-H bond may be a primary, secondary or 
tertiary carbon. 

The oxidation typically preserves stereoisomerism. 
Thus when the substrate consists of a single stereoisomer 
5 the product typically consists of a single corresponding 
stereoisomer, or can contain a preponderance of the 
corresponding stereoisomer. 

The enzyme used in the process is generally a P450 
enzyme, typically of eukaryotic or prokaryotic origin* 
10 The enzyme is generally of bacterial, fungal, yeast, 
% plant or animal origin, and thus may be from a bacterium 

03 0 f the genus Pseudomonas. The enzyme is typically a 

U monooxygenase. The non-mutant form of the enzyme may or 

W may not be able to oxidize terpenes and/or cycloalkenes . 

^1 5 The mutations discussed herein are generally 

*_ introduced into the enzyme by using methods known in the 

P art, such as site directed mutagenesis of the enzyme, PCR 

\t and gene shuffling methods or by the use of multiple 

a mutagenic oligonucleotides in cycles of site-directed 

U 20 mutagenesis. Thus the mutations may be introduced in a 
directed or random manner. The mutagenesis method thus 
produces one or more polynucleotides encoding one or more 
different mutants. Typically a library of mutant 
oligonucleotides is produced which can be used to produce 
25 a library of mutant enzymes. 

An amino acid 'in the active site' is one which 
lines or defines the site in which the substrate is bound 
during catalysis or one which lines or defines a site 
through which the substrate must pass before reaching the 
30 catalytic site. Therefore such an amino acid typically 
interacts with the substrate during entry to the 
catalytic site or during catalysis. Such an interaction 
typically occurs through an electrostatic interaction 
(between charged or polar groups), hydrophobic 
35 interaction, hydrogen bonding or van der Waals forces. 
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The amino acids in the active site can be 
identified by routine methods to those skilled in the 
art. These methods include labelling studies in which the 
enzyme is allowed to bind a substrate which modifies 
( x labels ' ) amino acids which contact the substrate. 
Alternatively the crystal structure of the enzyme with 
bound substrate can be obtained in order to deduce the 
amino acids in the active site. 

The enzyme may have 1, 2, 3, 4, 5 to 10, 10 to 20 
or more other mutations, such as substitutions, 
insertions or deletions- The other mutations may be in 
the active site or outside the active site. Typically 
the mutations are in the "second sphere' residues which 
affect or contact the position or orientation of one or 
more of the amino acids in the active site. The 
insertion is typically at the N and/or C terminal and 
thus the enzyme may be part of a fusion protein. The 
deletion typically comprises the deletion of amino acids 
which are not involved in catalysis, such as those 
outside the active site (thus the enzyme is a mutated 
fragment of a naturally occurring enzyme) . The enzyme may 
thus comprise only those amino acids which are required 
for oxidation activity. 

The other mutation in the active site typically 
alters the position and/or conformation of the substrate 
when it is bound in the active site. The mutation may 
make the site on the substrate which is to be oxidized 
more accessible to the haem group. Thus the mutation may 
be a substitution to an amino acid which has a smaller or 
larger, or more or less polar, side chain. 

The other mutations typically increase the 
stability of the protein, or make it easier to purify the 
protein. They typically prevent the dimerisation of the 
protein, typically by removing cysteine residues from the 
protein {e.g. by substitution of cysteine at position 334 
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of P450 calri/ or at an equivalent position in a homologue, 
preferably to alanine) . They typically allow the protein 
to be prepared in soluble form, for example by the 
introduction of deletions or a poly-histidine tag, or by 
5 mutation of the N-terminal membrane anchoring sequence ♦ 
The mutations typically inhibit protein oligomerisation, 
such as oligomerisation arising from contacts between 
hydrophobic patches on protein surfaces. 

Thus the mutant enzyme is typically at least 70% 

10 homologous to a naturally occurring haem-containing 
enzyme on the basis of amino acid identity. 

Any of the homologous proteins (i.e. described as 
being homologous to another protein) mentioned herein are 
typically at least 70% homologous to the relevant protein 

15 or at least 80 or 90% and more preferably at least 95%, 
97% or 99% homologous thereto over at least 20, 
preferably at least 30, for instance at least 40, 60 or 
100 or more contiguous amino acids. The contiguous amino 
acids may include the active site. This homology may 

20 alternatively be measured not over contiguous amino acids 
but over only the amino acids in the active site. 

Homology can be measured using known methods. For 
example the UWGCG Package provides the BESTFIT program 
which can be used to calculate homology {for example used 

25 on its default settings) {Devereux et al (1984) Nucleic 
Acids Research 12, p387-395) . The PILEUP and BLAST 
algorithms can be used to calculate homology or line up 
sequences (typically on their default settings), for 
example as described in Altschul S. F. (1993) J Mol Evol 

30 36:290-300; Altschul, 5, F et al (1990) J Mol Biol 
215:403-10. 

Software for performing BLAST analyses is publicly 
available through the National Center for Biotechnology 
Information ( http : / /www . ncbi . nlm . nih . gov/ ) . This 
35 algorithm involves first identifying high scoring 
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sequence pair (HSPs) by identifying short words of length 
W in the query sequence that either match or satisfy some 
positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is 
5 referred to as the neighbourhood word score threshold 
(Altschul et al, supra) . These initial neighbourhood 
word hits act as seeds for initiating searches to find 
HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the 

10 cumulative alignment score can be increased. Extensions 
for the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X 
from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or 

15 more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm 
parameters W, T and X determine the sensitivity and speed 
of the alignment. The BLAST program uses as defaults a 
word length (W) of 11, the BLOSUM62 scoring matrix {see 

20 Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 

89: 10915-10919) alignments (B) of 50, expectation (E) of 
10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis 
of the similarity between two sequences; see e.g., Karlin 

25 and Altschul (1993) Proc. Natl. Acad. Sci. USA 90: 5873- 
5787. One measure of similarity provided by the BLAST 
algorithm is the smallest sum probability (P(N)), which 
provides an indication of the probability by which a 
match between two nucleotide or amino acid sequences 

30 would occur by chance. For example, a sequence is 

considered similar to another sequence if the smallest 
sum probability in comparison of the first sequence to 
the second sequence is less than about 1, preferably less 
than about 0.1, more preferably less than about 0.01, and 

35 most preferably less than about 0.001. 
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Typically the homologous protein differs from the 
relevant protein by at least 1, 2, 5 or 10 mutations 
(substitutions, insertions or deletions) when compared to 
all of the protein or over any of the lengths of 
5 contiguous amino acids mentioned above. 

The enzyme used in the process is preferably a 
mutant of P450 cam (such as mutant of the sequence shown in 
table 7) or a mutant of a naturally occurring homologue 
of P450 cam , typically of P450 BM _ 3 from Bacillus megaterium 
10 (such as a mutant of the sequence shown in table 8), 
O P450 terp from Pseudomonas sp, and P450 eryF from 

Saccharopollyspora erythraea, and also P4 50 105 Dl 
:;i (CYP105) from Streptomyces griseus strains. Note that 

■J the amino acid numbering shown in table 8 for P450 BM _ 3 

-;J.5 does not correspond to the numbering used in the 

description to denote mutations in this enzyme* The 
sequence shown in table 8 contains an additional amino 
H= acid at the N terminal. This is normally cleaved in 

;:: x^ivo. Therefore each amino acid number shown in the 

-20 table is always one more than the number used in the 
conventional numbering (as used in the description) . 

The naturally occurring homologue of P450 cam (e.g. 
of P450 BM „ 3 ) may have substantially the same activity as 
P450 cam or P450 BM _ 3 . The homologue may be a species 
25 homologue or an allelic variant of P450 cam from 

Pseudomonas putida or of P450 BM _ 3 . The amino acids in the 
active site of the homologue may be the same as in the 
active site of P450 carct or of P450 BM _ 3 . Typically the amino 
acid at the equivalent position to 96 in P450 cam is a 
30 tyrosine in the homologue. 

The mutant of P450 cam or of a homologue of P450 cam is 
typically one in which amino acid 96, or the equivalent 
amino acid in a homologue, has been changed to an amino 
acid with a less polar side chain. In the case where the 
35 homologue is P450 BM _ 3 the mutant typically has a 
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substitution (to a less polar amino acid) at 47 and/or 51 
and/or 42 and/or 75 and/or 354 and/or 264 and/or 263 
and/or 181 and typically does not have a mutation at the 
equivalent site to 96 of P450 cam (preferred mutants of 
5 P450 BM _ 3 have at least mutations at 47 and 51, or at the 
equivalent sites in homologues) . 

Thus typically the substitution is to an amino acid 
which is above the original amino acid in Table 3, such 
as the preferred mutations shown in Table 4 and Table 5. 

10 The ^equivalent' side chain in the homologue is one 

at the homologous position. This can be deduced by lining 
up the P450 cam or P450 BM _ 3 sequence and the sequence of the 
homologue based on the homology between the two 
sequences. The PILEUP and BLAST algorithms can be used to 

15 line up the sequences. The equivalent amino acid will 

generally be in a similar place in the active site of the 
homologue as any of the specific amino acids discussed 
herein, such as amino acid 96 in P450 cam . 

The discussion below provides examples of the 

20 positions at which substitutions may be made in P450 cam 
and P450 BM _ 3 . The same substitutions may be made at 
equivalent positions in the homologues. Standard 
nomenclature is used to denote the mutations. The letter 
of the amino acid present in the natural form is followed 

25 by the position, followed by the amino acid in the mutant 
(these positions can be correlated to the numbering shown 
in tables 7 and 8 with the proviso discussed above with 
regard to table 8 amino acid numbering) . To denote 
multiple mutations in the same protein each mutation is 

30 listed separated by hyphens. The mutations discussed 

below using this nomenclature specify the natural amino 
acid in P450 cam or P450 BM _ 3 but it is to be understood that 
the (same) mutation could be made to a homologue which 
has a different amino acid at the equivalent position. 

35 An additional mutation is typically an amino acid 
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substitution at amino acid 87, 98, 101, 185, 244, 247, 
248, 296, 395, 396 of P450 cam (or a combination of these, 
for example as shown in table 4) . 

The following combinations of substitutions are 
5 preferred for P450 cam : 

(i) Substitution at position 87 to amino acids of 
different side-chain volume, such as substitutions 
(typically of F) to A, L, I and W, combined with 
substitutions at position 96 to amino acids of different 
10 side-chain volume such as {typically Y to) A, L, F, and 
p W. These combinations alter the space available in the 

i; upper part of the substrate pocket compared to the wild- 

H type enzyme, for example, from Y96W-F87W (little space) 

7j to Y96A-F87A (more space) , as well as the location of the 

-JL5 space, for example from one side in Y96F-F87A to the 

other in Y96A-F87W. 
\i (ii) Substitution at position 96 to F combined with 

fl substitutions at positions 185 and 395. Both T185 and 

Jj 1395 are at the upper part of the substrate pocket, and 

J2Q substitution with A creates more space while substitution 
with F will reduce the space available and push the 
substrate close to the haem. 

(iii) Substitutions at position 96 to A, L, F, and W 
combined with substitutions at residues closer to the 

25 haem including at 101, 244, 247, 295, 296 and 396 to A, 
L, F, or W. These combinations will create or reduce 
space in the region of the different side-chains to offer 
different binding orientations to substrates of different 
sizes. For example, the combinations Y96W-L244A and 

30 Y96L-V247W will offer very different substrate pockets 
for the binding of R-limonene. 

(iv) Triple substitutions at combinations of positions 
87, 96, 244, 247, 295, 296, 395 and 396 with combinations 
of A, L, F, and W. The aim is to vary the size and shape 

35 of the hydrophobic substrate binding pocket. For 
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example, the Y96A-F87A-L244A combination creates more 
space compared to the Y96F-F87W-V396L combination, thus 
allowing larger terpenes to bind to the former while 
restricting the available binding orientations of smaller 
5 terpenes in the latter. The combinations Y96F-F87W-V247L 
and Y96F-F87W-V295I have comparable substrate pocket 
volumes, but the locations of the space available for 
substrate binding are very different. The combination 
Y96F-F87L-V247A has a slightly larger side-chain volume 

10 at the 96 position than the combination Y96L-F87L-V247A, 
but the L side-chain at the 96 position is much more 
flexible and the substrate binding orientations will be 
different for the two triple mutants, 
(v) The mutants with four or five substitutions were 

15 designed with similar principles of manipulating the 

substrate volume, the different flexibility of various 
side-chains, and the location of the space available in 
the substrate pocket for terpene binding so as to effect 
changes in selectivity of substrate oxidation. 

20 The invention also provides the mutant of P450 cam or 

a mutant of a homologue of P450 carft {such as P450 BM _ 3 ) as 
discussed above, excluding mutants of P450 cam which only 
have the mutations : 

F87A-Y96G-F193A, F87A-Y96G-F193A-C334A or T101M-T185F- 
25 V247M. 

The mutant enzyme may be in a substantially 
isolated form and/or a substantially purified form, in 
which case it will generally comprise (e.g. about or at 
least) 90%, such as (e.g. about or at least) 95%, 98% or 

30 99% of the protein in the preparation. 

The invention also provides a polynucleotide which 
comprises a sequence which encodes the mutant enzyme of 
the invention. The polynucleotide is typically DNA or 
RNA, and may be single or double stranded. The 

35 polynucleotide may be able to hybridise with a 
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polynucleotide encoding the naturally occurring form of 
any mutant discussed herein (each the polynucleotide 
shown in table 7 or 8) . It typically hybridises with the 
relevant polynucleotide at a level significantly above 
5 background. The signal level generated by the 

interaction is typically at least 10 fold, preferably at 
least 100 fold, as intense as ^background' hybridisation. 
The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g. with 32 P. 

10 Selective hybridisation is typically achieved using 
conditions of medium to high stringency (for example 
0.03M sodium chloride and 0.03M sodium citrate at from 
about 50°C to about 60°C) . 

Polynucleotides of the invention can be 

15 incorporated into a recombinant replicable vector. The 
vector may be used to replicate the nucleic acid in a 
compatible host cell. Thus one method of making 
polynucleotides of the invention comprises introducing a 
polynucleotide of the invention into a replicable vector, 

20 introducing the vector into a compatible host cell, and 
growing the host cell under conditions which bring about 
replication of the vector. The vector may be recovered 
from the host cell. Suitable host cells are described 
below in connection with expression vectors. 

25 Preferably, a polynucleotide of the invention in a 

vector is operably linked to a control sequence which is 
capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an 
expression vector. 

30 The term "operably linked" refers to a 

juxtaposition wherein the components described are in a 
relationship permitting them to function in their 
intended manner. A control sequence "operably linked" to 
a coding sequence is ligated in such a way that 

35 expression of the coding sequence is achieved under 
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conditions compatible with the control sequences. 

Such vectors may be transformed into a suitable 
host cell to provide for expression of the mutant enzyme. 
The vectors may be, for example, plasmid, virus or 
5 phage vectors provided with an origin of replication, 
optionally a promoter for the expression of the said 
polynucleotide and optionally a regulator of the 
promoter. The vectors may contain one or more selectable 
marker genes, for example an ampicillin resistance gene 
10 in the case of a bacterial plasmid or a neomycin 
;.J resistance gene for a mammalian vector. Vectors may be 

used in vitro, for example for the production of RNA or 
V* used to transfect or transform a host ceil. 

Promoters and other expression regulation signals 
;,4L5 may be selected to be compatible with the host cell for 
w which the expression vector is designed. For example, 

"3 E.Coli promoters include lac, tac, trc, trp and T7 

;I promoters, and yeast promoters include S. cerevisiae GAL4 

-J; and ADH promoters, 5. pomJbe nmtl and adh promoters. 

{£20 Mammalian promoters include the met alio thionein promoter 
which can be induced in response to heavy metals such as 
cadmium. The expression vectors are possible for use in 
insect or mammalian cells. For use in insect cells, 
strong baculovirus promoters such as the polyhedrin 
25 promoter are preferred. For expression in mammalian 

cells, strong viral promoters such as the SV40 large T 
antigen promoter, a CMV promoter or an adenovirus 
promoter may also be used. All these promoters are 
readily available in the art. 
30 Expression vectors of the invention are typically 

introduced into host cells using conventional techniques 
including calcium phosphate precipitation, DEAE-dextran 
transf ection, or electroporation. 

The expression vector may contain a selectable 
35 marker and/or such a selectable marker may be co- 
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transfected with the expression vector and stable 
transfected cells may be selected. 

Suitable cells include cells in which the 
abovementioned vectors may be expressed. Such cells may 
be prokaryotic or eukaryotic. These include microbial 
cells typically bacteria such as E. coli, preferably the 
strains DH 5cf , JM109, NM522 and BL21DE3 or Pseudomonas, 
typically putida, mammalian cells such as CHO cells, COS7 
cells or HeLa cells, insect cells or yeast such as 
Saccharomyces. Baculovirus or vaccinia expression 
systems may be used. 

Cell culture can take place under standard 
conditions. Generally the cells are cultured in the 
presence of assimible carbon and nitrogen sources. 
Commercially available culture media for cell culture are 
widely available and can be used in accordance with 
manufacturers instructions. 

Typically the process of the invention is carried 
out in vitro, such as in a cell free system. The process 
may be carried out in vivo in a cell. 

Typically, in addition to the enzyme (a) and the 
substrate the process of the invention is carried out in 
the presence of an electron transfer reductase (b) , an 
electron transfer redoxin (c) , cofactor for the enzyme 
and an oxygen donor. In this system the flow of 
electrons is typically: cofactor -> (b) -> (c) -> (a) . 
However particular enzymes do not require the presence of 
an electron reductase and electron transfer redoxin, such 
as P4 50 BM „ 3 , Although the following discussion is 
particularly directed to enzymes which do require a 
reductase or redoxin, it is applicable to enzymes which 
do not require these (for example the various 
concentrations, conditions and rates are suitable for 
these enzymes) . 

For enzymes which do require a reductase and 
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redoxin (b) is generally an electron transfer reductase 
which is able to mediate the transfer of electrons from 
the cofactor to (c) , such as a naturally occurring 
reductase or a protein which has homology with a 
5 naturally occurring reductase, such as at least 70% 

homology; or a fragment of the reductase or homologue. 
Thus (b) may be derived from any of the organisms listed 
above from which the haem-containing enzyme may be 
derived, (b) is typically a flavin dependent reductase, 
10 such as putidaredoxin reductase. 
f1 (c) is generally an electron transfer redoxin which 

•4i is able to mediate the transfer of electrons from the 

^ cofactor to (a) via (b) . (c) is typically a naturally 

5" 1 occurring electron transfer redoxin or a protein which 

r "}15 has homology with a naturally occurring electron transfer 
S redoxin, such as at least 70% homology; or a fragment of 

the redoxin or homologue. Thus (c) may be derived from 
£5 any of the organisms listed above from which the haem- 

containing enzyme may be derived, (c) is typically a two- 
-20 iron/ two sulphur redoxin, such as putidaredoxin. 

The cofactor is any compound capable of donating an 
electron to (b) , such as NADH. The oxygen donor is any 
compound capable of donating oxygen to (a) , such as 
dioxygen. 

25 In the process the concentration of (a) , (b) or (c) 

is typically from 10' 8 to 10' 2 M, preferably from 10" 6 to 
10~ 4 M. Typically the ratio of concentrations of (a) : (b) 
and/or (a): (c) is from 0.1:01 to 1:10, preferably from 
1:0.5 to 1:2, or from 1:0.8 to 1:1.2. Generally the 

30 process is carried out at a temperature and/or pH at 

which the enzyme is functional, such as when the enzyme 
has at least 20%, 50%, 80% or more of peak activity. 
Typically the pH is from 3 to 11, such as 5 to 9 or 6 to 
8, preferably 7 to 7.8 or 7.4. Typically the temperature 

35 is 15 to 90°C, such as 25 to 75°C or 30 to 60°C. In one 
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following: 

H 2 N~P450 cam -TDGTSST-putidaredoxin reductase-TDGASSS- 
put i dar edoxin-COOH , 

H 2 N-P450 cani -TDGTRPGPGPGPGPSST-putidaredoxin 
5 reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P450 cam -TDGTRPGPGPGPGPGPGPSST-putidaredoxin 
reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-putidaredoxin reductases -TDGASSS- 
putidaredoxin-PLEL-P450 cam -COOH. 
10 However it is understood that the excluded E.Coll 

DH5a cells can be used to produce the libraries discussed 
below. 

A preferred cell (second type) is a cell which 
expresses : 

15 (a) (i) P450 BM _ 3/ or a fragment thereof; or 

(ii) a naturally occurring homologue of 
P450 BM _ 3 or a fragment thereof; or 

(iii) a mutant P450 BM _ 3 , or a mutant 

homologue of thereof. 

20 The cell provided by the invention is typically a 

cell from the species mentioned above in which the 
nucleotide of the invention can be expressed. The cell 
may be a mutator cell. Such a cell is generally 
deficient in one or more of the primary DNA repair 

25 pathways (such as E.Coli pathways mutS, mutD or mutT, or 
their equivalents in another organism) , and thus has a 
high mutation rate. Simply culturing such cell leads to 
the DNA encoding (a) to become mutated. The cell may be 
of E.Coli XL1 Red mutator strain* 

30 The cell of the invention may be in a substantially 

isolated form and/or substantially purified form, in 
which case it will generally comprise (e.g. at least or 
about) 90%, such as (e.g. at least or about) 95%, 98% or 
99% of the cells or the mass (normally measured in terms 

35 of dry mass) of the preparation. 
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embodiment the process is carried out in the presence of 
a substance able to remove hydrogen peroxide by-product 
(e.g. a catalase) . 

Alternatively the process of the invention could be 
5 carried out in the presence of the enzyme, substrate and 
an oxygen atom donor, such as hydrogen peroxide or t- 
butylhydroperoxide . Thus, the process could be carried 
out using the peroxide shunt. 

Typically in the process at least 20 turnover s/min 
10 occur, such as at least 50, 100, 200, 300, 500 or more 
J; turnovers (turnover is measured as nanomoles of product 

formed per nanomole of enzyme) . 
;f1 The invention also provides several types of cells. 

The first type expresses : 
115 an enzyme which can be used in the process which in 

f" its naturally occurring form has an electron transfer 
VV reductase domain; or expresses 

{t (a) the mutant haem-containing enzyme which is used in 

VI the process of the invention; 

20 (b) an electron transfer reductase; and 
(c) an electron transfer redoxin. 

The second type of cell expresses: 

(a) (i) P450 caa or a fragment thereof; or 

25 (ii) a naturally occurring homologue of P450 cam or 

a fragment thereof; or 
(iii) a mutant of P450 cam ; or 

(iii) a polypeptide which has at least 70% amino 

acid homology with (i) or (ii) and optionally 
30 has any of the combination of mutations 

discussed herein; and 

(b) an electron transfer reductase; and 

(c) an electron transfer redoxin: 

excluding an E.Coli DH5a cell in which the only 
35 mutants of P450 cam which are expressed are amongst the 
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The cell may be one which does not naturally 
express {a), (b) or (c) . The cell be one in which (a), 
(b) or (c) are expressed at a higher level than in the 
naturally occurring cell* (a) may originate from the 
5 same organism as (b) or (c) . 

In the cell (a) , (b) and (c) may be expressed from 
the same vector, or may be expressed from different 
vectors. They may be expressed as three different 
polypeptides* Alternatively they may be expressed in the 
10 form of fusion proteins. Typically components (a), (b) 
^\ and (c) are all present in the same fusion protein. 

Alternatively only two of the components, preferably (b) 
*P and (c) , may be present in the fusion protein* Typically 

;=r the components are contiguous in the fusion protein and 

-15 there is no linker peptide present. 

\ Alternatively a linker may be present between the 

5^ components. The linker generally comprises amino acids 

.[ that do not have bulky side chains and therefore do not 

^ obstruct the folding of the protein subunits. Preferably 

■'20 the amino acids in the linker are uncharged. Preferred 

amino acids in the linker are glycine, serine, alanine or 
threonine. In one embodiment the linker comprises the 
sequence N-Thr-Asp-Gly-Gly-Ser-Ser-Ser-C. The linker is 
typically from at least 5 amino acids long, such as at 
25 least 10, 30 or 50 or more amino acids long. 

The first type of cell may be obtained by 
transforming or transfecting a host cell with a 
polynucleotide or vector of the invention. 

The mutant enzyme of the invention may be prepared 
30 by a process comprising cultivating the first type of 
cell under conditions to provide for expression of the 
said mutant enzyme, and optionally recovering the 
expressed mutant enzyme. 

The process of the invention may be carried out in 
35 the first type of cell of the invention or in the second 
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type of cell if it is able to oxidize the substrate, or a 
medium containing it. Generally such a process comprises 
providing the substrate in the cell, allowing the 
substrate to be oxidized in the process of the invention, 
5 and optionally obtaining the oxidation product therefrom, 
e.g. by extraction. The substrate is typically provided 
in the cell by adding the substrate to the outside of the 
cell and allowing it to enter the cell. Alternatively the 
substrate could be synthesized in the cell from a 
10 precursor. 

The invention also provides a process for making a 
5 library of mutants of P450 cain or mutants of a homologue of 

:,: P450 cam comprising contacting the second type of cell with 

\:J a mutagen and/or when the cell is a mutator cell 

Tj5 culturing the cell in conditions in which mutants are 

produced. The mutagen may be contacted with the cell 
: ;j prior to or during culturing of the cell. Thus the 

mutagen may be present during replication of the cell or 
q replication of the genome of the cell. 

^° The mutagen generally causes random mutations in 

the polynucleotide sequence which encodes (a) . The 
mutagen is typically a chemical mutagen, such as 
nitrosomethyguanidine, methyl- or ethylmethane sulphonic 
acid, nitrite, hydroxylamine, DNA base analogues, and 

25 acridine dyes, such as proflavin. It is typically 

electromagnetic radiation, such as ultra-violet radiation 
at 2 60 nm (absorption maximum of DNA) and X-rays. It is 
typically ionising radiation. 

Typically the library will be in the form of cells 

30 which are derived from cells of the invention by 

mutagenesis and which cells comprise the mutant enzymes. 
Generally each cell will express only one particular 
mutant enzyme. The library typically comprises at least 
500 mutants, such as at least 1,000 or 5,000 mutants, 

35 preferably at least 10,000 different mutants. 
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The library typically comprises a random population 
of mutants. The library may undergo one or more rounds 
of selection whilst being produced and therefore may not 
comprise a random population. Between rounds of 
5 selection the cells in the library may be allowed to 

replicate, and they may also be contacted with a mutagen ♦ 

A mutant can be selected from the library based on 
a particular property of the mutant. The property may 
comprise one or more of the following characteristics: 
10 (i) the ability to oxidize a particular substrate; 
; f optionally to a particular oxidation product or to a 

fri product with a particular activity. 

(ii) the ability to carry out the oxidation of substrate 
u j at an increased rate, 

Id5 (iii) a reduced oxidation activity towards a particular 
s substrate, 

rf {iv) a reduction in the production of a particular 

substrate. 

tl Typically the activity of the product in (i) is 

gO blocking the action of an agent which is lethal to the 
cells of the library. This can be selected by growing 
the library in the presence of the agent. The agent is 
typically expressed within the cells of the library. 

The activity may be the binding of the product to a 

25 particular substance, such a protein. The substance is 
typically present in the cell of the library, and/or is 
typically a disease-causing or therapeutic target. An 
indicator which binds the substance is typically used to 
detect binding of the product to the substance. In one 

30 embodiment the indicator is able to bind the substance 
and has a property which changes upon binding, e.g. a 
colour change. Product which displaces the indicator 
from the substance can thus be detected by measuring 
changed in the property. 

35 The invention also provides a method of making a 
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library of oxidation products comprising providing a 
substrate to the library of mutant enzymes and allowing 
oxidation of the substrate. 

Products produced in the process of the invention, 
5 or identified, selected, made or designed using the 

library could be used in therapy or diagnosis. Thus the 
invention provides a method of treating a host suffering 
from a disease, which method comprises administering to 
the host a therapeutically effective amount of the 
10 product. The condition of a patient suffering from the 
■ : f disease and in need of the product can therefore be 
7j improved by administration of the product. The product 

can also be given as a prophylactic, typically to a host 
: ; j which is at risk from or susceptible to the disease* 
15 The invention provides the product for use in a 

, : method of treatment of the human or animal body by 
j.:;; therapy. The invention also provides the product for use 

in a diagnostic method practiced on the human or animal 
;J body. The invention also provides use of the product in 
26 the manufacture of a medicament to treat a disease. 

The formulation of the product for use in 
preventing or treating infection by an organism will 
depend upon factors such as the nature of the product 
identified, whether a pharmaceutical or veterinary use is 
25 intended, etc. In order to be administered to a patient, 
the compound will be provided in the form of a 
pharmaceutical composition containing the product and a 
pharmaceutically acceptable carrier or diluent. Suitable 
carriers and diluents include isotonic saline solutions, 
30 for example phosphate-buffered saline. Typical oral 
dosage compositions include tablets, capsules, liquid 
solutions and liquid suspensions. For example it may be 
formulated for parenteral, intravenous, intramuscular, 
subcutaneous, transdermal or oral administration. 
35 The dose of product may be determined according to 
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various parameters, especially according to the substance 
used; the age, weight and condition of the patient to be 
treated; the route of administration; and the required 
regimen * A physician will be able to determine the 
5 required route of administration and dosage for any 

particular patient. A suitable dose may however be from 
0.1 to 100 mg/kg body weight such as 1 to 40 mg/kg body 
weight, for example, to be taken from 1 to 3 times daily. 
The invention is illustrated by the accompanying 

10 drawings which show the gas chromatography results for 
various oxidation reactions . In the drawings unless 

:.: stated otherwise the y-axis shows mVolts and the x-axis 

shows Time (in minutes) . 

■ y j In the drawings : 

%5 Figure 1 shows the oxidation of camphor by the 

C334A mutant of P450 cam (expressed from plasmid SGB++) . 

S Lines A, B, C, D and E represent camphor turnover at 2, 
10, 20, 40 and 100 minutes. The 5.28 peaks is camphor, 

;1; 11.82 is 5-exo-hydroxycamphor, 7.45 is 5-ketocamphor and 

BO the 16.07 peak is the internal standard. 

Figure 2 shows R- and S- limonene with wild type 
P450 BM - 3 . 

Figure 3 shows NADH consumption by wild type and 
mutant P450 BM _ 3 . 

25 Figure 4 shows the whole cell E. coli oxidation of 

a-pinene by the Y96F-F87W-V247L mutant expressed by 
plasmid pCWSGB+. 

The invention is also illustrated by the Examples; 
Example 1 

30 Expression of mutants for in vitro work. 

The P4 50 cam enzymes were expressed using the vector 
pRH1091 {Baldwin, J.E., Blackburn, J.M., Heath, R.J., and 
Sutherland, J.D. Bioorg. Med. Chem. Letts., 1992, 2, 663- 
668.) which utilised the trc promoter (a fusion of the 
35 trp and lac promoters) . This vector incorporates a strong 
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ribosome binding site (RBS) and the gene to be expressed 
is cloned using an Nde I site on the 5 T end of the gene. 
We used Hind III as the cloning site at the 3 1 end of the 
camC gene. The procedure for protein expression is as 
5 follows: Cells are grown at 30°C until the OD 600nm reaches 
1.0 - 1.2/ the temperature is increased to 37°C and 
camphor added as a 1 M stock in ethanol to a final 
concentration of 1 mM. The culture is allowed to incubate 
at 37°C for another 6 hours. The P450 cain protein is 
10 expressed to high levels in the cytoplasm and the cells 
take on a red to orange-red colour. 

We have also prepared a variant of pRH1091 (by PCR) 
which has a extra Xba I site between the RBS and the Nde 
;-\ I site. This is important because Nde I is not unique in 

:_^15 M13, and this restriction site is also present in the 

reductase gene as well as the backbone of the pGLWll 
U vector used for the in vivo system. Xba I is unique in 

\~ the polyiinker region of M13, but absent in the genes of 

:J all three proteins in the P450 cata system and in the 

20 expression vectors. It therefore allows the camC gene to 
be moved between the mutagenic and expression vectors. 

The P450 BM „ 3 enzyme from Bacillus megaterium was 
expressed using either the pGLWll or pCW vectors. The 
recombinant plasmid with the P450 BM _ 3 gene inserted into 
25 either of these vectors were transformed into E. coli 
strain DH5a and grown under ampicillin selection. A 
single colony was then grown at 30°C in LB media 
supplemented with ampicillin until the OD at 600 nm 
reached ca. 1, and protein expression was induced by 
30 adding IPTG from a 1 M stock to a final concentration of 
ImM. After 6-8 h, cells were harvested by centrifugation 
and the expression levels were high, as indicated by an 
orange-red coloration of the cells. 
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How the mutants were made. 

Oligonucleotide-directed site-specific mutagenesis 
was carried out by the Kunkel method (Kunkel, T. A. Proc. 
Natl. Acad. Sci . USA 1985, 82, 488-492) using the Bio-Rad 
5 Mutagen kit and by PCR using the QuikChange kit from 
Stratagene. The recommended procedure for the Kunkel 
method is summarised as follows. An M13 mp!9 subclone of 
the camC gene encoding P450 cam was propagated in the E. 
coli strain CJ236. This strain has the du t~ung~ phenotype 
10 and thus will tolerate the inclusion of uracil in place 
.3 of thymine in DNA molecules. After three cycles of 

^ infection, uracil-containing single stranded (USS) M13 
:jl DNA was readily isolated by phenol extraction of mature 
^ M13 phage particles excreted into the growth medium. The 
i5 mutagenic oligonucloetide (or oligonucleotides) were 
^ phosphorylated with T4 polynucleotide kinase and then 
□ annealed to the USS template. The four nucleotides, DNA 
^ polymerase, DNA ligase, ATP and other chemical components 
Cl were added and the second strand was synthesised in 
20 vitro. The double stranded form thus obtained was 

transformed into the dut+ ung+ E. coli strain MV1190, 
which should degrade the uracil-containing template 
strand and propagate the mutant strand synthesised in 
vitro. Plaques were picked and phages of possible mutants 
25 grown in E. coli strains MV1190 or TGI. The single- 
stranded DNA from these were sequenced to determine 
whether the mutagenesis reaction was successful. The 
mutagenic efficiency was 50 - 80%. The mutant camC gene 
is excised from the M13 subclone by restriction digest 
30 with Nde I and Hind III, and the fragment of appropriate 
size is ligated to the backbone of the expression vector 
prepared by a similar Nde I/Hind III digest. 

The QuikChange kit relies on the property of the Dpn 
I restriction enzyme which selectively cleaves methylated 
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DNA. The mutation is introduced by PCT using double 
stranded plasmid DNA, and hence no single stranded 
template preparations are necessary. The PCR reaction is 
carried out with two oligonucleotides, one of which binds 
5 to the coding strand and the other to the sense strand. 
Each oligonucleotide contains a short stretch of 
polynucleotide complementary to either side of the 
mutation site. After in vitro synthesis by PCR using 
non-methylated dNTP's, plasmid DNA with overlapping nicks 
10 in each strand were digested with Dpn I to remove the 
S starting template selectively - plasmid DNA isolated from 
7q most E. coli strains contain methylated bases but the 
~2 newly synthesised DNA do not have methylated bases. 
:~j After the digest the DNA is transformed into 
16 supercompetent E. coli XL1 Blue cells and propagated. 

The plasmid DNA from potential mutants which grow on agar 
y plates under antibiotic selection were isolated and 
;u sequenced to confirm mutagenesis. The cells can then be 
; : J used for protein expression once the entire sequence of 
£0 the new mutant was confirmed to ensure that there were no 
spurious mutations. 

Multiple mutants were prepared either by further 
mutagenesis, also by the Kunkel method, or where the 
location of the sites in the sequence permits, simple 
25 cloning steps. There are two unique restriction sites 

within the camC gene which are absent from the expression 
vector. One is Sph I which spans residues 121 - 123, and 
the other is Sal I which spans residues 338 and 339. 
Therefore, all mutations at, for example, residues 87, 
30 96, 98, and 101 are readily combined with mutations at 

higher number residues by ligating appropriate fragments 
from restriction digests of mutant camC genes with Nde 
1/Sph I and Sph I /Hind III and the backbone fragment from 
a Nde 1/ Sph I digest of the expression vector. Mutations 
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10 



at, for example/ 395 and 396 can be similarly 
incorporated by digests in which Sph I is replaced with 
Sal I. 

The rationale for introducing the unique Xba I site 
is now clear: many mutants with multiple mutations were 
prepared by the cloning procedure above. Without the Xba 
I site it would be impossible to clone the gene for these 
multiple mutants from the expression vector back into M13 
for further rounds of mutagenesis. Of course these 
problems could be overcome by doing mutagenesis by PCR, 
for example. 



1'S 



2.a 



Example 2 

Substrate oxidation "protocol: In vitro reactions 



Component 
P450 cam enzyme 
Putidaredoxin 
Putidaredoxin reductase 
Bovine liver catalase 
KC1 

Substrate 
NADH 



Final concentration 
1 pM 
10 pM 
1 pM 

20 pg/ml 
200 mM 

Typically 1 mM 
250 - 400 pM 



* 50 mM Tris-HCl buffer pH 7.4 is added to make up 
25 the volume* 

* Temperature controlled at 30°C, optional. 

* The NADH turnover rate could be determined by 
monitoring the absorbance at 34 0 nm with time. 

* Catalase does not catalyse the substrate 

30 oxidation reactions but rather it is present to remove 
any hydrogen peroxide by-product which could otherwise 
denature the P4 50 caIn . 

The method can be increased in scale to, for 
example, 20 ml total incubation volume to allow 

35 purification of sufficient products by HPLC for 
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spectroscopic characterisation. Fresh substrate (1 mM) 
and NADH (1-2 mM) are added periodically, such as every 
20 minutes in a total reaction time of, typically, 3 
hours . 

5 

Example 3 

The in vivo system 

The in vivo systems were expressed using the vector 
pGLWll, a derivative of the plasmid pKK223 (Brosius, J. 
10 and Holy, A* Proc. Natl. Acad. Sci. USA, 1984, 81, 6929- 
jj 6933) . Expression is directed by the tac promoter and the 

\% vector incorporates a gene conferring resistance to the 

Ifl antibiotic ampicillin. 

'l\ Two systems were constructed. The first one 

•JE5 expressed the electron transfer proteins putidaredoxin 

reductase {camA gene) and putidaredoxin (camB gene) as a 
P fusion protein with a seven amino acid peptide linker, 

and the P450 cam enzyme (camC gene) was expressed by the 
^ same vector but it was not fused to the electron 
20 proteins. The second system expressed the three proteins 

as separate entities in the E.Coli host. Both systems 

were catalytically competent for substrate oxidation in 

vivo. 

The general strategy was as follows. The genes for 
25 the three proteins were cloned using Eco RI and Hind III 
as flanking sites, with Eco RI at the 5 T end* For both 
in vivo systems there are restriction sites between the 
genes, including between the reductase and redoxin genes 
in the fusion construct. These restriction sites were 
30 introduced by PGR, as detailed below. The first task, 

however, was to carry out a silent mutation to remove the 
Hind III site within the camA gene for the reductase. 
The AAGCTT Hind III recognition sequence in the camA gene 
was changed to AAGCCT, which is a silent mutation because 
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GCT and GCC both encode alanine. The gene was completely 
sequenced to ensure that there were no spurious 
mutations . 

5 1 . The fusion protein system 

2. a Manipulation of the camA gene by PCR 

For the camA gene the primer below was used at the 
5 f end of the gene to introduce the Eco RI cloning site 
and to change the first codon from GTG to the strong 
10 start codon ATG. 

3 5 ' - GAG ATT AAG AAT TCA TAA ACA CAT GGG AGT GCG TGC CAT ATG AAC GCA 

;f AAC 

~j\ Eco RI RBS \-camA 

13 At the 3 T end of camA the primer was designed such 

^ that 15 bases are complementary to nucleotide sequence of 
the last five amino acid residues of camA. The stop 
codon immediately after the GCC codon for the last amino 
acid was removed, and then part of a seven amino acid 
20 linker {Thr Asp Gly Gly Ser Ser Ser) which contained a 
Bam HI cloning site {GGATCC - Gly Ser) was introduced. 
The coding sequence was thus: 

5*- GAA CTG AGT AGT GCC ACT GAC GGA GGA TCC TCA TCG-3* 
25 camA - Thr Asp Gly Gly Ser 

\Bam HI | 

The primer sequence shown below is the reverse 
complement used for PCR: 
30 5 1 - CGA TGA GGA TCC TCC GTC AGT GGC ACT ACT CAG TTC-3 f 

l.Jb Manipulations of the camB gene by PCR 

For the camB gene the primer at the 5 1 end 
incorporated the second half of the peptide linker 
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between the reductase and redoxin proteins, and the 
restriction site Bam HI for joining the two amplified 
genes together. 

5'- TCA TCG GGA TCC TCA TCG ATG TCT AAA GTA GTG TAT- 3 1 

Gly Ser Ser Ser |- camB 
\Bam HI | Start 

At the 3 r end of camB the primer incorporates 12 
nucleotides complementary to the end of camB followed by 
the stop codon TAA, a 6 nucleotide spacer before the GGAG 
ribosome binding site. Xba I and Hind III sites were 
then added to allow cloning of the camC gene when 
required. The sequence of the coding strand was 
therefore : 

5 « - CCC GAT AGG CAA TGG TAA TCA TCG GGAG TCT AGA GCA TCG AAG CTT TCA 
TCG-3 1 

CamB -J stop RBS Xba I Hind III 

The primer shown below is the reverse complement used for 
PCR; 

5 ' -CGA TGA AAG CTT CGA TGC TCT AGA CTCC CGA TGA TTA CCA TTG CCT ATC 
GGG -3 1 

l.c Preparation of the full fusion construct 

The camA and camB genes were amplified by the PCR 
using the primers described above. The new camA was 
digested with Eco RI and Bam HI, while the new CamB was 
digested with Bam HI and Hind III. The pGLWll expression 
vector was digested with Eco RI and Hind III. All three 
were purified by agarose gel electrophoresis and the 
three gel slices containing the separate fragments were 
excised from the gel and ligated together, and then 
transformed into E.Coli DH5cx. Successful ligation of all 
the fragments were confirmed by a series of restriction 
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digestion experiments, especially the presence of the new 
and unique Xba I site. The entire sequence of the insert 
from the Eco RI site to the Hind III site was determined 
to ensure that all the sequences were correct. 

The new plasmid, named pSGB F - was transformed into 
E.Coll and expression of the reductase and redoxin 
proteins was induced by IPTG. When a purified P450 cam 
enzyme was added to the cell-free extract, substrate 
oxidation was observed for a variety of substrates. 

When the camC gene is cloned into the pSGB F plasmid 
using the Xba I and Hind III restriction sites, the new 
recombinant plasmid thus generated expresses the 
reductase and redoxin as a fusion protein and the P450 cam 
enzyme as a operate entity both from the same mRNA 
molecule* This in vivo system is catalytically competent 
for terpene oxidation in whole cells. 



2. The In vivo system with the protein expressed 
separately 

2. a The basic strategy 

The starting point of the preparation of this in 
vivo system was the recombinant plasmid used to express 
the camA gene for putidaredoxin reductase. The camA gene 
was cloned into the pGLWll plasmid using the Eco RI and 
Bam HI restriction sites, with Eco RI being at the 5 1 end 
of the gene. Conveniently the polylinker region of the 
pGLWll vector has a Hind III site downstream of the Bam 
HI site. The camB gene was therefore manipulated by PCR 
such that it can be cloned into pGLWll using the Bam HI 
and Hind III sites. This new plasmid expresses the 
reductase and redoxin as separate proteins. 

The camB gene was cloned into pUC118 by the Bam HI 
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and Hind III cloning sites to express putidaredoxin for 
our general in vitro substrate oxidation work* 
Therefore/ the PCR primer at the 3' end of the camB gene 
was designed to introduce a ribosome binding site and the 
5 Xba I restriction site upstream of the Hind III site so 
that the camC gene can be inserted downstream of camB 
using the Xba I and Hind III sites. Therefore the three 
genes were cloned without fusion in the pGLWll expression 
vector and arranged in the order 5 1 -camA-camB-camC-3 9 , 
10 and each gene has its own RBS to initiate protein 
synthesis. 

^ 2.b .Manipulations of the camB gene 

5 We used the internal and unique restriction site Mlu 

1:5 I (recognition sequence ACGCGT) within the camB gene as 
L, the starting point so that the PCR product has a 
C. different size from the PCR template fragment. The 
\l primers were as follows: 

TO 5 f - TCA TCG ACG CGT CGC GAA CTG CTG-3 1 

where the Mlu I site is in bold. 

The desired coding sequence at the 3 l end of the 
25 camB gene was: 

5'- CCC GAT AGG CAA TGG TAA GTA GGT GAA TAT CTA ATC CCC ATC 
camB -[stop 

TAT GCG CGA GTG GAG TCT AGA GTT CGA-3 1 
30 RBS Xba I 

After the stop codon there is a 35 base spacer 
before the RBS which is used to initiate the synthesis of 
the P4 50 cani enzyme . The Xba I cloning site is located 
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within the spacer between the RBS and the start codon 
(not in this primer) of the camC gene. The PCR primer 
used was the reverse complement of the sequence above. 
The PCR was carried out and the amplified fragment of the 
appropriate size was purified by agarose gel 
electrophoresis and the gel slice excised. 

One extra step was necessary to complete the 
construction of the new plasmid. The plasmid for the 
fusion protein in vivo system was digested with Mlu I and 
Hind III restriction enzymes, purified by agarose gel 
electrophoresis, and the gel slice for the small camB 
fragment excised. The pUC118 plasmid for camB expression 
was similarly digested, and the gel slice for the 
backbone was excised. By ligating the two fragments 
together we prepared a new pUC118-based plasmid which had 
an Xba I site followed by an Hind III site downstream of 
the stop codon of camB. This new plasmid was digested 
with the Mlu I and Xba I enzymes and the backbone was 
ligated with the new camB fragment described above to 
generate a plasmid with the following arrangement of the 
key components: 

..lac Promoter . .Bam EI.. camB grene . . spacer . . RBS . . Xba 
I. .Hind III. . 

2.c Preparation of the in vivo system plasmid 

Once the modified camB with the Xba I and Hind III 
restriction sites and appropriate spacers were prepared, 
the in vivo system was constructed by cloning this into 
the pGLWll-based plasmid used to express the camA gene 
(reductase protein) using the Bam HI and Hind III sites. 
The new in vivo system vector has the following 
arrangement of the key components: 

..tac Promoter. .Eco IRI. . RBS . . camA grene .. spacer Bam 
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HI • .RBS. .camB gene . . spacer . . RBS * . Xba I.. Hind III.. 

This new plasmid, named pSGB\ was transformed into 
E.Coli and expression of the reductase and redoxin 
proteins was induced by IPTG. When a purified P450 cam 
enzyme was added to the cell-free extract, substrate 
oxidation was observed for a variety of substrates. 

When the camC gene is cloned into this pSGC + plasmid 
using the Xba I and Hind III restriction sites, the new 
recombinant plasmid thus generated will express the three 
proteins separately, each under the direction of its own 
RBS but from the same mRNA molecule. Thus constitutes 
the in vivo system used in the vast majority of our 
terpene oxidation work. 

3. Introduction of an Xba I site into pRH1091 

This is the final step to enable the camC gene to be 
cloned into the in vivo systems by the two cloning sites 
Xba I and Hind III. The Xba I site was added by PCR of 
the entire pRH1091 plasmid using two primers. The 
presence of these two sites will also enable cloning of 
the camC gene into Ml 3 since both Xba I and Hind III are 
unique in camC and M13. 

The primers shown below maintain the Hind III 
cloning site AAGCTT: 

5 '-TCA TCG AAG CTT GGC TGT TTT-3 1 
Hind III |- vector 

At the other end the coding sequence desired was: 

5 T -ACA ATT TCA CAC AGGA TCT AGA C CAT ATG TCA TCG AAG CTT TCA 
TCG- 3 1 

Vector -| RBS Xba I Nde I Hind III 
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This sequence maintained the Nde I and Hind III sites but 
the new Xba I site was introduced upstream of the Nde I 
site. The PCR primer used was the reverse complement of 
the desired sequence: 

5 

5 T -CGA TGA AAG CTT CGA TGA CAT ATG GTC T AGA TCCT GTG TGA AAT 
TGT-3 1 

The PCR product was then purified by agarose gel 
10 electrophoresis, digested with Hind III and circularised 
□ with T4 DNA ligase. Success of the PCR method was 
.rj indicated by the presence of a new and unique Xba I site 
y! in plasmid DNA isolated from transf ormants . 

JU5 4 . Cloning of camC into the In vivo systems 

f" All existing camC mutants were cut out of pRH1091- 

^ based expression plastids with Nde I and Hind III. The 
_I new vector is similarly cut with the same restriction 
= ; enzymes and the camC gene cloned into this plasmid with 
SjD T4 DNA ligase. This DNA is transformed into E.Coli 0M109 
which then may be grown to express P4 50 cam . 

The camC gene is excised from the new vector using 
Xba I and Hind III restriction enzymes and cloned into 
either the in vivo vector systems or M13mpl9 for 
25 mutagenesis. 

5 . In vivo expression and substrate turnover 

For protein expression, cells are grown in LBamp 
medium (tryptone 10 g/litre, yeast extract 5 g/litre, 
30 NaCl 10 g/litre, 50 \iq/ml ampicillin) at 30°C until the 
OD 600nm reaches 1.0 - 1.2. IPTG (isopropyl~3-D- 
thiogalactopyranoside) was added to a final concentration 
of 1 pM (from a 1 M stock in H 2 0) and the culture was 
incubated at 30°C overnight. 



SUBSTITUTE SHEET (RULE 26) 



€ 

WO 00/31273 PCT/GB99/03873 

-36- 

For simple screening the substrate can be added to 
culture and the incubation continued. However, due to 
impurities from the culture media the cells were 
generally washed twice with 0.5 vol. of buffer P, (KH 2 P0 4 
6.4 g, K 2 HP0 4 .3H 2 0 25.8 g, H 2 0 to 4 litres, pH 7.4) and 
resuspended in 0.25 vol. oxygen saturated buffer P 
containing 24 rnM glucose. Substrate was added to 1 mM and 
the incubation continued at 30 °C. The reaction was 
allowed to run for 24 hours with periodic additions of 
substrate and glucose. 

The reaction was analysed by extracting 1 ml of the 
incubation mixture with 250 pi of ethylacetate . After 
centrifuging in a microcentrifuge at 13, 000 g for 2 
minutes, 2 pi of the organic extract was injected onto a 
0.25 mm x 30 m DB-1 gas chromatography column in a Fisons 
GC 8000 series gas chromatograph. The samples were 
carried through the column using helium carrier gas and 
the compounds present were detected using a flame 
ionisation detector. 

A variety of temperature programmes were used for 
different substrate to resolve turnover products. 



Monoterpenes 

Injector temperature 
Detector temperature 
Oven temperature 



Sesqui terpenes 

Injector temperature 
Detector temperature 
Oven temperature 



150 °C 
250 °C 

120 °C for 15 min - 200 °C 
at 25 °C/min, 200 °C for 1 
min. 

250 °C 
250 °C 

150 °C - 230 °C at 5 °C/min, 
230 °C for 1 min. 



Results for particular oxidation reactions are shown 
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in table 6 and in the Figures . 

Figure 1 shows the result of an oxidation reaction 
with camphor. The 5-ketocamphor arises from further 
oxidation of 5-exo-hydroxycamphor . As can be seen there 
is surprisingly little of the further oxidation occurring 
in the presence of camphor. 

Example 4 

A second In vivo expression system 

The cluster of genes for the expression of the three 
proteins of the P450 cam system, as described in Example 3, 
were also expressed in whole E. coli cells using the pCW 
vector. This vector utilises two tac promoters arranged 
in line to increase protein expression. It has a RBS, 
and contains the gene conferring resistance to the 
antibiotic ampicillin (Barnes, H.J. Methods Enzymol. 
1996, 272, 3-14} . 

Both methods of expressing catalytically competent 
P450 cam systems described in Example 3 were successful 
with the pCW vector. Thus the fusion system, where 
putidaredoxin reductase and putidaredoxin were expressed 
as a fusion protein with an oligopeptide linker, but the 
P450 cam monooxygenase was expressed but not fused to the 
electron transfer proteins. The second system expressed 
all three proteins as separate entities in the same E. 
coli host. 

1. The fusion protein system 

A new plasmid was constructed by cloning the gene 
for the fusion of electron transfer proteins into pCW so 
that different P450 cam mutants could be introduced into 
the system by cloning. The 5 T end oligonucleotide used 
for the PCR amplification of the cam A gene introduced 
not only the Eco RI site for cloning into the pGL Wll 
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vector but also a Nde I site which spans the ATG start 
codon of the gene (see Example 3, section la) . In the 
pCW vector there is a Nde I site positioned downstream of 
the RBS for cloning of the gene to be expressed- The pGL 
5 Wll vector containing the camA-camB fusion gene was 

digested with Nde I and Hind III, and the insert purified 
by agarose gel electrophoresis. The pCW vector system 
was also digested with these two enzymes and the 
linearised vector purified by the same method. The two 
10 fragments were ligated with DNA ligase to generate the 
r :.^ new pCWSGB F plasmid based on the pCW vector and which 
:.i expressed the fusion of the electron transfer proteins. 
;« The insert excised from the pGL Wll-based plasmid already 
U contained a RBS for protein expression and an Xba I site 
1-S just upstream of the Hind III site (see Example 3, 

section 2.c), so that the cam C gene encoding P450 cam 
mutants can be cloned using these two sites and expressed 
^ off the RBS. Therefore genes encoding the P450 cain mutants 
r] can be excised from the modified pRH system and cloned 
£0 into the new pCWSGB F plasmid using the Xba I and Hind III 
sites. 

2* The three proteins expressed separately 

This system was generated in exactly the same way as 
25 for the fusion system. Thus the pSGB+ plasmid was 

digested with the Nde I and Hind III restriction enzymes 
and the insert cloned into the pCW vector. This new 
plasmid pCWSGB-f expressed putidaredoxin reductase and 
putidaredoxin as separate entities off the twin tac 
30 promoters of the pCW vector. The P4 5 0 cam mutants were 

introduced into this vector using the Xba I and Hind III 
sites . 

3. In vivo expression and substrate turnover 
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The conditions described below were used for test 
purposes in shake flasks in a laboratory and were not 
optimised* Under proper fermenter conditions with higher 
expression and biomass the final yield of products will 
5 be much increased. 

E. coll DH5a cells harbouring either of the 
catalytically competent P450 cam systems were grown from a 
single colony on an agar plate in 1 L of LBamp medium at 
30 °C until OD 600nm reached ca. 1. IPTG was added to 1 mM 
10 final concentration {from a 1 M stock in water) and the 
\* culture grown for a further 6 h. The final OD 600nm were in 

g the range 2.0 - 2*5. Cells were harvested by 

^ centrifugation at 5000 g and washed once with 40 mM 

I.-, phosphate buffer, pH 7.4* The cell pellet was 

";;15 resuspended in 500 ml of 4 0 mM phosphate buffer, pH 7.4, 

Glucose was added as a 2 M stock to a final concentration 
Tl of 100 mM, and 1 mL of the substrate (cx-pinene or R- 

■r^ limonene) was added to start the reaction. The mixture 

•4 was shaken in an open 2 L conical flask in an orbital 

H20 incubator at 200 rpm. More glucose was added every 24 h 
(100 mM final concentration based on a 500 mL volume, 
from a 2 M stock) and more substrate (1 mL) was added 
every 12 h. The progress of the reaction was monitored 
by GC and the whole cell system was active for at least 5 
25 days at ambient temperatures and the minimum yield at the 
end of day 5, as assayed by extraction of the reaction 
medium with chloroform and analysing by GC, was 100 mg of 
products without accounting for volatilisation of 
products into the atmosphere and condensation on the 
30 flask above the liquid level. In addition, no compounds 
arising from further hydroxylation at another carbon atom 
were observed when the substrate was present. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 PCT/GB99/03873 

-40- 

Example 5 

In vitro and In vivo substrate oxidation by P450 WM . , 

In a typical reaction in vitro (optional 30°C 
temperature control) the 1.5 mL incubation mixture 
contained 40 mM phosphate buffer, pH 8.0, 1 pM P450 BM _ 3 , 
50 pg/mL of catalase, and terpene substrates were added 
as a 1 M stock in ethanol to a final concentration of 2 
mM. NADPH was added typically to 400 ]M final 
concentration, and the rates of reaction could be 
monitored at 340 iua. After all the NADPH had been 
consumed, the mixture was extracted by vortexing with 0.5 
mL of chloroform, the phases separated by centrif ugation, 
and the organic phase could be analysed by GC using the 
programs described in Example 3 - 

Catalase does not catalyse the substrate oxidation 
reaction but rather it is present to remove any hydrogen 
by-product which could otherwise denature the enzyme. The 
method can be increased in scale to, for example, 20 mL 
total incubation volume to allow purification of 
sufficient produces by HPLC for spectroscopic 
characterisation. Fresh substrate (1 mM) and NADPH (1-2 
mM) are added periodically, such as every 20 minutes in a 
total reaction time of, typically 3 hours. 

Since the P450 BM _ 3 enzyme is catalytically self- 
sufficient, i.e., both the monooxygenase and electron 
transfer domains are in a single polypeptide, the enzyme 
as expressed in E. coli can be used for whole cell, in 
vivo substrate oxidations. The procedure described under 
Example 3 for in vivo substrate oxidation by the P450 cam 
enzyme can also be used for the P450 BM _ 3 enzyme. 
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orimene 



Table 1 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 



-42- 



PCT/GB99/03873 



H 2 C 



CH S 



H 



■CH a 



CH 3 
(+)-Aromadendrene 



CH 3 H 



H 3 C 




H 

H 2 C 
B-Caryophyllene 

(-)-*nms-Caryophyllene 





<+)-VaIencene 



CH 3 




Isobazzanene 



CH 3 - 




H 3 C 

Silphinene 





Ishwarane 



Isopatchchoul-3-ene 




Isosesquicarene 



Table 2 
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Table 3. HYDROPATHY SCALE FOR AMINO ACID 
SIDE CHAINS 



Side Chain Hydropathy 



He 


4.5 


Val 


4.2 


Leu 


3.8 


Phe 


2.8 


Cys 


2.5 


Met 


1.9 


Ala 


1.8 


Gly 


-0.4 


Thr 


-0.7 


Ser 


-0.8 


Trp 


-0.9 


Tyr 


-1.3 


Pro 


-1.6 


His 


-3.2 


Glu 


-3.5 


Gin 


-3.5 


Asp 


-3.5 


Asn 


-3.5 


Lys 


-3.9 


Arg 


-4.5 
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Table 4: P45CL^mutants 



Ail mutants optionally contain the base mutation C334A. 



Single mutants: Y96A, Y96F, Y96L, Y96W 




Double mutants: 








Y96A-F87A 


Y96F-F87A 


Y9 65*— V!? O 


Y96L-F87A 


Y96A-F87L 


Y96F-F87I 




Y96L-F87L 


Y96A-F87W 


Y96F-F87Ii 




Y96L-F98W 


Y96A-F98W 


Y96F— F87W 


X ror- Ji2 ybXi 


Y96L-T101L 




l9oF- F98W 


Y96F-A296F 


Y96L-T101F 


Y96A-V247A 


Y96F-T1Q1L 


Y96F-I395F 


Y96L-L244A 


Y96A-V247L 


Y96F-T101F 


Y96F-I395G 


Y96L-I.244F 


Y96A-I395F 


Y96F-T185A 


Y96F-V396A 


Y96X.-V247A 


Y96A-I395G 


Y96F-T185F 


Y96F-V396L 


Y96L-V247L 




Y96F-T185I* 


Y96F-V396F 


Y96I.-V247F 




Y96F-L244A 


Y96F-V396W 


Y96L-V247W 




Y96F-V247A 




Y96L-G248L 




Y96F-V247I* 




Y96L-V295L 




Y96F-G248L 




Y96X.-V295F 



Y96L-A296L 
Y96L-A296F 
Y963>V396A 
Y96L-V396L 
Y96L-V396F 
Y96L-V396W 



Y96W-F87W 

Y96W-F98W 

Y96W-L244A 

Y96W-V247A 

Y96W-V396A 



Trxple Mutants : 
Y96A-F87A-L244A 
Y96A-F87A-V247A 
Y96A-F87L-L244A 
Y96A-F87L-V247A 
Y9 6A~ 1*2 4 4A- V2 4 7 A 

Y96L-F87A-L244A 

Y96L-F87A-V247A 

Y96L-F87I.-L244A 

Y96I.-F87L-V247A 

Y96L-V247A-I395F 

Y96L-V247L-I395F 

Y961-V247L-I395G 

Y96L-L2 44A-V3 9 6L 

i96L-L244A-V396F 

Y96L-L244A-V396W 

Y96L-L2 44F-V3 9 6A 



Y96L-V247A-V396I. 
Y96L-V247A-V396F 
Y96L-V247A-V396W 
Y96L-V247F-V396A 

Y96F-F87A-L244A 
Y96F-F87A-V247A 
Y96F-F87A-V247I, 
Y96F-F87A-I395F 
Y96F-F87A-I395G 
Y96F-F873>V247A 
Y96F-F87l i -V247I t 
Y96F-F87I.-I395F 
Y96F-F87W-T185A 
Y96F-F87W-T185F 
Y96F-F8 7W-T1 8 5L 
Y96F-F87W-L244F 



Y96F-F87W-V247A 

Y96F-F87W-V247I, 

Y96F-F87W-V247F 

Y96F-F87W-V295L 

Y96F-F87W-A296L 

Y9SF-F87W-V396A 

Y96F-F87W-V396L 

Y96F-V247F-V396A 

Y96F-L244A-V396I. 

Y96F-L244A-V396F 

Y9 6F-I244A-V396W 

Y96F-L244F-V396A 

Y96F-V2 47A-V3 9 6L, 

Y96F-V247A-V396F 

Y9 6F-V247A- V3 9 6W 

Y96W-F87W-F98W 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 



45* 



PCT/GB99/03873 



c 
o 
u 



CC 



! 

$ 
1 

> 

fa 



ni 
in 

i 

hI 
r- 



e 

! 

Hi 



p p 



in 
oo 

rH 
EH 

A 

s 

1 

fa 

vo 

X 



h3 
in 
oo 

Eh 

J 



fa 
I 

fa 
<0 

o> 

X 



Hi 

vo 

e 

1 

H* 

? 

Hi 

m 

CO 
rH 
EH 

{2 

r* 

CO 

fa 

j 

fa 

vo 

G\ 

X 



§ 

•H 
■P 

id 
-p 

W 



P 

1 



CM 

I 

00 

fa 

A 

VO 

cn 

X 



Hi 

- I 

hJ 
r- 
oo 
fa 

* 

x 



CM 

h? 
I 

a 

CO 
fa 

Hi 
VO 

X 



p 



CM 
Hi 

I 

Hi 

co 
fa 

i 

Hi 

VO 

av 



Hi 

m 

? 

CM 

Hi 

00 

fa 
vo 



VO 

? 

fa 

CM 
Hi 

oo 
fa 

i 

fa 
vo 



H? 

! 

CM 

Hi 

r* 

co 
fa 

i 

fa 
vo 

C\ 

X 



h! 
vo 
a\ 

s 

p 

oo 
fa 

i 

fa 
vo 
c\ 



fa 

vo 
cv 

CO 

r* 

I 

00 

fa 

1 

fa 

vo 

X 



tn 

p 

i 

Hi 

co 
fa 
i 

fa 
vo 
a* 
X 



5 

p 

I 

Hi 

oo 
fa 

i 

fa 

VO 
C\ 

X 



vo 

2 

I 

fa 

oo 
fa 
i 

fa 

VO 

0* 

X 



fa 

in 
a* 
en 

H 

A 

p 

oo 
fa 

i 

fa 
vo 

X 



in 
a\ 
co 

H 

p 

!> 

00 
fa 
1 

fa 
VO 

a\ 
X 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 PCT/GB99/03873 

-46- 

>— i i— * 
1 1 
fX, 

m in 

! I 



> J ^ ^ < 

^ ^ ^ J 

VO \0 ^ ^ 

M N 2 2 

< < < 2 s 

JXi {JLt tL, fcL, 

in in in m 

I I I I I 

^ _J »-J 

i> 

TT t 

cc! o4 pi 



in m 



I 

2 



i 

2 



a 



<; g 

m m m m 



I I I I I I 

fin (X, C3U Uh 

in in m in in in 

>* > >M >< 



I 

2 



I 

2 



1 



i 



< ^ 

CN CN £2 
^ ^ 7? 

4- 

Uh »3 



>o «r> «p 
5* > > 

Dei pgj 



«n £ 



in 
l 



< 



2 



2 



^t* ^" 

vo vo 

(N N N 

< < ^ 

t I i 

£££ 

o 

oo oo oo 

Ph b 

I E i 
U. U, 

m in m 

^* ^ 

1 I t 

^5 ^ ^5 

OsS P< 



s 

ca 
O 
ID 

in 

H 



U< < J 

^ in m 

^ ^ > 
l f 
< < 



i 



o£ c4 c4 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 



PCT/GB99/03873 



■47- 



CO 

•§ 

8 
a, 

^ 

o 
tM 
o 

CO 

a 

o 

tE 

o 

& 



s 

2 
a, 

< 



I d 
o « 

^ oo 

^ a> 
*g 

-° . 

d co" 

.2 -§ 

ts 

.13 o 
o ° 

O CO 

o o 

^ ° 

>> "3 

> a? 
o 

ss S 

§ g 

x: ° 

°.f 

CO 

£ 2 
3 s 
.2 

NO 

* s 

«Q — 

H S 



O 

a 



CO 



O 



CD 
*-* 

2 e 

£ cd 

CCS o 

u 

.S T3 

O co 

E CQ 

CO 1 — ' 
c/f E 

t3 o 



CO 
4> 



"3 

B 

CO 

b 

<L> 
> 



& t 

CO CO 

O O 

CO CO 

a* &♦ 

£ g 

GO CO 

o o 



E 

od 
i 



o 

no 
P 



\° \° 

0 s o x 0 s - 

O m CN 

oo V V 



O 
oo 

A 



O 

I 



<N in 



V7 " 



o 

vd 
r 

o 

VO 



S 
o 

vd 
i 

o 

vd 



- r— < 

o 

CO 



cN 



N° V° N° 
V« ^ ^ vO 

>^ 0 s * ^ o in ©\ 
r tn w a os <n 

^ AAA 



1 s 



0 s * 



m 

i 

m 

CM 



O 
i 



\P s« s« 
0 s - 0 s - 0 s - 

oo ^ in 



7 ^ 



CO CO 

2 S 

a, a, 

O O 

xO nO 

0 s * 6 s - 

O O 

m (N 

OX) GJD 

c c 

"s 's 

s s 

H ii 



CO 
■*-"* 

CO O 

I 2 
2 

Cu )-i 

S « >> 

> 



c 
o 
a 



N? ^ VP V« 
0 s C X 0 s - o x 

o o tr> 



o 

S 

o o o o 
Jj m m 

> 



O nP n» no 
0 s 6 s - 0 s - 
g O O O 



CO 
<D 

CU 



s 

CO 



C 

On O 



^ a 



A ^ 



o 

2 



m i 



0 s - 

O O 

V X 



2 



o 

OS 

•IS 



§ e 



6 s * 6 s - 
O O 



o x 0 s 
m O 



s 

O sp 

m 0 s - 
, ♦ tn 
oo 

o 

CN 

vd 



0 s - 6 s - 
O m 



o 

00 



0 s - 



o 

VO 



t 

o 



G 



vO vo 

ON ON 



-J 

> 

»— 3 (jr. 

oo oo 



1-4 

> 
t 

EX 



< 

S >:> 

S ' - 

jr 2 vo vo 

7 '-T C\ On 

s ^ >• 



—1 
> 

I 

VO 
o\ 

oo 



S 

o 
c 

"3 



< 

O CN CM 
S> > 

-2 vo vo 

IT" ON CV 

^ > >- 



< 



s 
c 

r— on 



SUBSTITUTE SHEET (RULE 26) 



WO 00/31273 



-48- 

Table 7: P450cam wild-type sequence 
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acq act gaa acc ata caa 
Thr Thr Glu Thr He Gin 
1 5 
cat gtg cca gag cac ctg 
His Val Pro Glu His Leu 
20 

aat ctg tct gcc ggc gtg 
Asn Leu Ser Ala Gly Val 
35 

aac gta ccg gat ctg gtg 
Asn Val Pro Asp Leu Val 
50 

gcc act cgc ggc caa ctg 
Ala Thr Arg Gly Gin Leu 
65 70 
ttt tec age gag tgc ccg 
Phe Ser Ser Glu Cys Pro 
85 

gac ttc att ccc acc teg 
Asp Phe lie Pro Thr Ser 
100 

gcg ctg gcc aac caa gtg 
Ala Leu Ala Asn Gin Val 
115 

aac egg ate cag gag ctg 
Asn Arg He Gin Glu Leu 
130 

caa gga cag tgc aac ttc 
Gin Gly Gin Cys Asn Phe 
145 150 



age aac gcc aat ctt 
Ser Asn Ala Asn Leu 
10 

gta ttc gac ttc gac 
Val Phe Asp Phe Asp 
25 

cag gag gcc tgg gca 
Gin Glu Ala Trp Ala 
40 

tgg act cgc tgc aac 
Trp Thr Arg Cys Asn 
55 

ate cgt gag gcc tat 
lie Arg Glu Ala Tyr 
75 

ttc ate cct cgt gaa 
Phe He Pro Arg Glu 
90 

atg gat ccg ccc gag 
Met Asp Pro Pro Glu 
105 

gtt ggc atg ccg gtg 
Val Gly Met Pro Val 
120 

gcc tgc teg ctg ate 
Ala Cys Ser Leu He 
135 

acc gag gac tac gcc 
Thr Glu Asp Tyr Ala 
155 



gcc cct ctg cca ccc 48 
Ala Pro Leu Pro Pro 
15 

atg tac aat ccg teg 96 
Met Tyr Asn Pro Ser 
30 

gtt ctg caa gaa tea 144 
Val Leu Gin Glu Ser 
45 

ggc gga cac tgg ate 192 

Gly Gly His Trp He 

60 

gaa gat tac cgc cac 240 
Glu Asp Tyr Arg His 
80 

gcc ggc gaa gcc tac 288 
Ala Gly Glu Ala Tyr 
95 

cag cgc cag ttt cgt 336 
Gin Arg Gin Phe Arg 
110 

gtg gat aag ctg gag 384 
Val Asp Lys Leu Glu 
125 

gag age ctg cgc ccg 432 
Glu Ser Leu Arg Pro 
140 

gaa ccc ttc ccg ata 480 
Glu Pro Phe Pro He 
160 
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cgc ate ttc atg 
Arg lie Phe Met 

ttg aaa tac eta 
Leu Lys Tyr Leu 
180 

ttc gca gag gec 
Phe Ala Glu Ala 
195 

gag caa cgc agg 
Glu Gin Arg Arg 
210 

aac ggc cag gtc 
Asn Gly Gin Val 
225 

atg tgt ggc ctg 
Met Cys Gly Leu 

etc age ttc age 
Leu Ser Phe Ser 
260 

gag ctg ate gag 
Glu Leu lie Glu 
275 

etc egg cgc ttc 
Leu Arg Arg Phe 
290 

tac gag ttt cat 
Tyr Glu Phe His 
305 



-49 

ctg etc gca ggt eta 
Leu Leu Ala Gly Leu 
165 

acg gat cag atg acc 
Thr Asp Gin Met Thr 
185 

aag gag gcg etc tac 
Lys Glu Ala Leu Tyr 
200 

cag aag ccg gga acc 
Gin Lys Pro Gly Thr 
215 

aat ggg cga ccg ate 
Asn Gly Arg Pro lie 
230 

tta ctg gtc ggc ggc 
Leu Leu Val Gly Gly 
245 

atg gag ttc ctg gec 
Met Glu Phe Leu Ala 
265 

cgt ccc gag cgt att 
Arg Pro Glu Arg lie 
280 

teg ctg gtt gec gat 
Ser Leu Val Ala Asp 
295 

ggc gtg caa ctg aag 
Gly Val Gin Leu Lys 
310 



ccg gaa gaa gat 
Pro Glu Glu Asp 
170 

cgt ccg gat ggc 
Arg Pro Asp Gly 

gac tat ctg ata 
Asp Tyr Leu lie 
205 

gac get ate age 
Asp Ala lie Ser 
220 

acc agt gac gaa 
Thr Ser Asp Glu 
235 

ctg gat acg gtg 
Leu Asp Thr Val 
250 

aaa age ccg gag 
Lys Ser Pro Glu 

cca gee get tgc 
Pro Ala Ala Cys 
285 

ggc cgc ate etc 
Gly Arg lie Leu 
300 

aaa ggt gac cag 
Lys Gly Asp Gin 
315 



ate ccg cac 528 
lie Pro His 
175 

age atg acc 576 

Ser Met Thr 

190 

ccg ate ate 624 
Pro lie lie 

ate gtt gec 672 
lie Val Ala 

gee aag agg 720 
Ala Lys Arg 
240 

gtc aat ttc 7 68 
Val Asn Phe 
255 

cat cgc cag 816 

His Arg Gin 

270 

gag gaa eta 864 
Glu Glu Leu 

acc tec gat 912 
Thr Ser Asp 

ate ctg eta 960 
lie Leu Leu 
320 
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ccg cag atg ctg tct 
Pro Gin Met Leu Ser 
325 

cac gtc gac ttc agt 
His Val Asp Phe Ser 
340 

ggc age cat ctg tgc 
Gly Ser His Leu Cys 
355 

gtc acc etc aag gaa 
Val Thr Leu Lys Glu 
370 

ccg ggt gec cag att 
Pro Gly Ala Gin lie 
385 

gca etc cct ctg gtc 
Ala Leu Pro Leu Val 
405 



-50- 

ggc ctg gat gag cgc 
Gly Leu Asp Glu Arg 
330 

cgc caa aag gtt tea 
Arg Glu Lys Val Ser 
345 

ctt ggc cag cac ctg 
Leu Gly Gin His Leu 
360 

tgg ctg acc agg att 
Trp Leu Thr Arg lie 
375 

cag cac aag age ggc 
Gin His Lys Ser Gly 
390 

tgg gat ccg gcg act 
Trp Asp Pro Ala Thr 
410 



gaa aac gec tgc 
Glu Asn Ala Cys 

cac acc acc ttt 
His Thr Thr Phe 
350 

gec cgc egg gaa 
Ala Arg Arg Glu 
365 

cct gac ttc tec 
Pro Asp Phe Ser 
380 

ate gtc age ggc 
He Val Ser Gly 
395 

acc aaa gcg gta 
Thr Lys Ala Val 
414 



PCT/GB99/03873 

ccg atg 1008 

Pro Met 

335 

ggc cac 1056 
Gly His 

ate ate 1104 
He He 

att gec 1152 
He Ala 

gtg cag 1200 
Val Gin 
400 

1242 
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Table 8: P450 BM . 3 sequence 



1/1 

atg aca att aaa gaa atg cct 
Met Thr lie Lys Glu Met Pro 
61 / 21 

tta aac aca gat aaa ccg gtt 
Leu Asn Thr Asp Lys Pro Val 
121 /41 

ttt aaa ttc gag gcg cct ggt 
Phe Lys Phe Glu Ala Pro Gly 
181 / 61 

gaa gca tgc gat gaa tea cgc 
Glu Ala Cys Asp Glu Ser Arg 
241 / 81 

gat ttt gca gga gac ggg tta 
Asp Phe Ala Gly Asp Gly Leu 
301 / 101 

cat aat ate tta ctt cca age 
His Asn He Leu Leu Pro Ser 
361 / 121 

gtc gat ate gec gtg cag ctt 
Val Asp lie Ala Val Gin Leu 
421 / 141 

gaa gta ccg gaa gac atg aca 
Glu Val Pro Glu Asp Met Thr 
481 / 161 

tat cgc ttt aac age ttt tac 
Tyr Arg Phe Asn Ser Phe Tyr 
541 / 181 

gca ctg gat gaa gca atg aac 
Ala Leu Asp Glu Ala Met Asn 
601 / 201 

gaa aac aag cgc eag ttt caa 
Glu Asn Lys Arg Gin Phe Gin 
661 / 221 

att gca gat cgc aaa gca age 
lie Ala Asp Arg Lys Ala Ser 
721 / 241 

gga aaa gat cca gaa acg ggt 
Gly Lys Asp Pro Glu Thr Gly 
781 / 261 

aca ttc tta att gcg gga cac 
Thr Phe Leu lie Ala Gly His 
841 / 281 

tta gtg aaa aat cca cat gta 
Le£ Val Lys Asn Pro His Val 
S01 / 301 

gat cct get cca age tac aaa 
Asp Pro Val Pro Ser Tyr Lys 
361 / 321 

gaa gcg ctg cgc tta tgg cca 
Glu Ala Leu Arg Leu Trp Pro 
1021 / 341 

gtg ctt gga gga gaa tat cct 
Val Leu Gly Gly Glu Tyr Pro 
1081 / 361 

ctt cac cgt gat aaa aca att 
Leu His Arg Asp Lys Thr He 
1141 / 381 

gaa aat cca agt gcg att ccg 
Glu Asn Pro Ser Ala He Pro 
1201 / 401 

tgt ate ggt cag cag ttc get 
Cys He Gly Gin Gin Phe Ala 
1261 / 421 

cac ttt gac ttt gaa gat cat 
His Phe Asp Phe Glu Asp His 
1321 / 441 

aaa cct gaa ggc ttt gtg gta 
Lys Pro Glu Gly Phe Val Val 
1381 / 461 



31 / 
cag cca aaa acg ttt gga 
Gin Pro Lys Thr Phe Gly 

91 / 
caa get ttg atg aaa att 
Gin Ala Leu Met Lys He 

151 / 
cgt gta acg cgc tac tta 
Arg Val Thr Arg Tyr Leu 

211 / 
ttt gat aaa aac tta agt 
Phe Asp Lys Asn Leu Ser 

271 / 
ttt aca age tgg acg cat 
Phe Thr Ser Trp Thr His 

331 / 
ttc agt eag cag gca atg 
Phe Ser Gin Gin Ala Met 

391 / 
gtt caa aag tgg gag cgt 
Val Gin Lys Trp Glu Arg 

451 / 
cgt tta acg ctt gat aca 
Arg Leu Thr Leu Asp Thr 

511 / 
cga gat cag cct cat cca 
Arg Asp Gin Pro His Pro 

571 / 
aag ctg cag cga gca aat 
Lys Leu Gin Arg Ala Asn 

631 / 
gaa gat ate aag gtg atg 
Glu Asp He Lys Val Met 

691 / 
ggt gaa caa age gat gat 
Gly Glu Gin Ser Asp Asp 

751 / 
gag ccg ctt gat gac gag 
Glu Pro Leu Asp Asp Glu 

811 / 
gaa aca aca agt ggt ctt 
Glu Thr Thr Ser Gly Leu 

871 / 
tta caa aaa gca gca gaa 
Leu Gin Lys Ala Ala Glu 

931 / 
caa gtc aaa cag ctt aaa 
Gin Val Lys Gin Leu Lys 

991 / 
act get cct gcg ttt tec 
Thr Ala Pro Ala Phe Ser 

1051 / 
tta gaa aaa ggc gac gaa 
Leu Glu Lys Gly Asp Glu 

1111 / 
tgg gga gac gat gtg gaa 
Trp Gly Asp Asp Val Glu 

1171 / 
cag cat gcg ttt aaa ccg 
Gin His Ala Phe Lys Pro 

1231 / 
ctt cat gaa gca acg ctg 
Leu His Glu Ala Thr Leu 

1291 / 
aca aac tac gag ctg gat 
Thr Asn Tyr Glu Leu Asp 

1351 / 
aaa gca aaa teg aaa aaa 
Lys Ala Lys Ser Lys Lys 
1411 / 



11 

gag ctt aaa aat tta ccg tta 
Glu Leu Lys Asn Leu Pro Leu 
31 

gcg gat gaa tta gga gaa ate 
Ala Asp Glu Leu Gly Glu He 
51 

tea agt cag cgt eta att aaa 
Ser Ser Gin Arg Leu He Lys 
71 

caa gcg ctt aaa ttt gta cgt 
Gin Ala Leu Lys Phe Val Ara 
91 

gaa aaa aat tgg aaa aaa gcg 
Glu Lys Asn Trp Lys Lys Ala 
111 

aaa ggc tat cat gcg atg atg 
Lys Gly Tyr His Ala Met Met 
131 

eta aat gca gat gag cat att 
Leu Asn Ala Asp Glu His He 
151 

att ggt ctt tgc ggc ttt aac 
He Gly Leu Cys Gly Phe Asn 
171 

ttt att aca agt atg gtc cgt 
Phe He Thr Ser Met Val Arg 
191 

cca gac gac cca get tat gat 
Pro Asp Asp Pro Ala Tyr Asp 
211 

aac gac eta gta gat aaa att 
Asn Asp Leu Val Asp Lys He 
231 

tta tta acg cat atg eta aac 
Leu leu Thr His Met Leu Asn 
251 

aac att cgc tat caa att att 
Asn He Arg Tyr Gin He He 
271 

tta tea ttt gcg ctg tat ttc 
Leu Ser Phe Ala leu Tyr Phe 
291 

gaa gca gca cga gtt eta gta 
Glu Ala Ala Arg Val Leu Val 
311 

tat gtc ggc atg gtc tta aac 
Tyr Val Gly Met Val Leu Asn 
331 

eta tat gca aaa gaa gat acg 
Leu Tyr Ala Lys Glu Asp Thr 
351 

eta atg gtt ctg att cct cag 
Leu Met Val Leu He Pro Gin 
371 

gag ttc cgt cca gag cgt ttt 
Glu Phe Arg Pro Glu Arg Phe 
391 

ttt gga aac ggt cag cgt gcg 
Phe Gly Asn Gly Gin Arg Ala 
411 

gta ctt ggt atg atg eta aaa 
Val Leu Gly Met Met Leu Lys 
431 

att aaa gaa act tta acg tta 
He Lys Glu Thr Leu Thr Leu 
451 

att ccg ctt ggc ggt att cct 
He Pro Leu Gly Gly He Pro 
471 
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cat gaa gac tat ctg tat caa 

Hi a Glu Asp Tyr Leu Tyr Gin 

2881 / 961 

ctt cat acc get ttt tct cgc 

Leu His Thr Ala Phe Ser Arg 

2941 / 581 

gaa caa gac ggc aag aaa ttg 

Glu Gin Asp Gly Lys Lys Leu 

3001 / 1001 

gga gac gga age caa atg gca 

Gly Asp Gly Ser Gin Met Ala 

3061 / 1021 

gtt cac caa gtg agt gaa gca 

Val His Gin Val Ser Glu Ala 

3121 / 1041 

cga tac gca aaa gac gtg tgg 

Arg Tyr Ala Lys Asp Val Trp 



-53- 



gaa 


gag 


ctt 


gaa aac 


gec 


bill 


Glu 


Leu 


Glu Asn 


Ala 








2511 


/ 


atg 


cca 


aat 


cag ccg 


aaa 


fieti 


Pro 


Asn 


Gin Pro 


Lys 








2971 


/ 


att 


gaa 


ctt 


ctt gat 


caa 


iJ.e 


V?XU 


Leu 


Leu Asp 


Gin 










/ 


ect 


gec 


gtt 


gaa gca 


acg 


Pro 


Ala 


Val 


Glu Ala 


Tnr 








3091 


/ 


gac 


get 


cgc 


tta tgg 


ctg 


Asp 


Ala 


Arg 


Leu Trp 


Leu 


get 


ggg 


taa 






Ala 


Gly 


OCH 







caa age 


gaa 


ggc 


ate 


att 


acg 


Gin Ser 


Glu 


Gly 


He 


lie 


Thr 


571 












aca tac 


gtt 


cag 


cac 


gta 


atg 


Tnr Tyr 


Val 


Gin 


His 


Val 


Met 


551 












gga gcg 


cac 


ttc 


tat 


att 


tgc 


Gly Ala 


His 


Phe 


Tyr 


He 


Cys 


1011 










ctt atg 


aaa 


age 


tat 


get 


gac 


Leu Met 


Lys 


Ser 


Tyr 


Ala 


Asp 


1031 










cag cag 


eta 


gaa 


gaa 


aaa 


ggc 


Gin Gin 


Leu 


Glu 


Glu 


Lys 


Gly 
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tea cct age act gaa 

Ser Pro Ser Thr Glu 

1441 / 481 

acg ccg etg ctt gtg 

Thr Pro Leu Leu Val 

1501 / 501 

tta gca gat att gca 

Leu Ala Asp lie Ala 

1561 / 521 

gee gga aat ctt ccg 

Ala Gly Asn Leu Pro 

1621 / 541 

ccg cct gat aac gca 

Pro Pro Asp Asn Ala 

1681 / 561 

aaa ggc gtt cgc tac 

Lys Gly Val Arg Tyr 

1741 / 581 

aaa gtg cct get ttt 

Lys Val Pro Ala Phe 

1801 / 601 

cgc ggt gaa gca gat 

Arg Gly Glu Ala Asp 

1861 / 621 

atg tgg agt gac gta 

Met Trp Ser Asp Val 

1921 / 641 

tct act ctt tea ctt 

Ser Thr Leu Ser Leu 

1981 / 661 

ggt gcg ttt tea acg 

Gly Ala Phe Ser Thr 

2041 / 681 

age acg cga cat ctt 

Ser Thr Arg His Leu 

2101 / 701 

tta ggt gtt att cct 

Leu Gly Val He Pro 

2161 / 721 

eta gat gca tea cag 

Leu Asp Ala Ser Gin 

2221 / 741 

etc get aaa aea gta 

Leu Ala Lys Thr Val 

2281 / 761 

acg cgc acg cag ctt 

Thr Arg Thr Gin Leu 

2341 / 781 

ctt gaa gee ttg ctt 

Leu Glu Ala Leu Leu 

2401 / 801 

atg ctt gaa ctg ctt 

Met Leu Glu Leu Leu 

2461 / 821 

ctt ctg cca age ata 

Leu Leu Pro Ser He 

2521 / 841 

aaa caa gca age ate 

Lys Gin Ala Ser He 

2581 / 861 

tat aaa gga att gcg 

Tyr Lys Gly He Ala 

2641 / 881 

ttt att tec aea ccg 

Phe He Ser Thr Pro 

2701 / 901 

atg gte gga ccg gga 

Met Val Gly Pro Gly 

2761 / 921 

eta aaa gaa caa gga 

Leu Lys Glu Gin Gly 

2821 / 941 



cag 
Gin 


tct 
Ser 


gee 
Ala 


aaa 
Lys 


aaa 
Lys 


eta 
Leu 


tac 
Tyr 


ggt 
Gly 


uCa 

Ser 


aat 
Asn 


atg 
Met 


age 
Ser 


aaa 
Lys 


gga 
Gly 


ttt 
Phe 


cgc 
Arg 


gaa 
Glu 


gga 

Gly 


get 
Ala 


gta 
Val 


aag 
Lys 


caa 
Gin 


ttt 
Phe 


gte 
Val 


gac 
Asp 


tec 
Ser 


gta 
Val 


ttt 
Phe 


gga 
Gly 


tgc 
Cys 


ate gat 
He Asp 


gaa 
Glu 


acg 
Thr 


ctt 
Leu 


gca 

Ala 


age 
Ser 


gac 

Asp 


gac 
Asp 


ttt 
Phe 


gca 
Ala 


gee 
Ala 


tac 
Tyr 


ttt 
Phe 


aac 
Asn 


caa 
Gin 


ttt 
Phe 


gte 
Val 


gac 
Asp 


age 
Ser 


aac 
Asn 


gte 
Val 


gta 
Val 


gca 
Ala 


age 
Ser 


gaa 
Glu 


att 
He 


gaa 
Glu 


ctt 
Leu 


cca 
Pro 


cgc aac 
Arg Asn 


tat 
Tyr 


gaa 
Glu 


gga 
Gly 


caa 
Gin 


ate 
He 


egt 
Arg 


ctg 
Leu 


gaa 
Glu 


tec 
Ser 


gta 
Val 


gaa 
Glu 


gag 
Glu 


ctt 
Leu 


cgc gca 
Arg Ala 


a tg 
Met 


get 
Ala 


get 
Ala 


gaa 
Glu 


aag 
Lys 


caa 
Gin 


gee 
Ala 


tac 
Tyr 


gaa 
Glu 


aaa 
Lys 


tac 
Tyr 


ccg 
Pro 


gcg 

Ala 


cgc 
Arg 


ccg 
Pro 


cgc 
Arg 


tat 
Tyr 


tac 
Tyr 


acg gte 
Thr Val 


age 
Ser 


gtt 
Val 


gte 
Val 


teg 
Ser 


aac 
Asn 


tat 
Tyr 


ctt 
Leu 


gee 
Ala 


cag 
Gin 


tea 
Ser 


gaa 
Glu 


ttt 
Phe 


acg 
Thr 


aca 
Thr 


ggc 
Gly 


gte 
Val 


gcg 
Ala 


ccg 
Pro 


cag 
Gin 


tea 
Ser 


ctt 
Leu 


gga 

Gly 


gaa 
Glu 



gca cgc aaa aag gca 
Val Arg Lys Lys Ala 
1471 / 491 
atg gga aca get gaa 
Met Gly Thr Ala Glu 
1531 / 511 
gca ccg cag gte gca 
Ala Pro Gin Val Ala 
1591 / 531 
tta att gta acg gcg 
Leu lie Val Thr Ala 
1651 / 551 
tgg tta gac caa gcg 
Trp Leu Asp Gin Ala 
1711 / 571 
ggc gat aaa aac tgg 
Gly Asp Lys Asn Trp 
1771 / 591 
gee get aaa ggg gca 
Ala Ala Lys Gly Ala 
1831 / 611 
gaa ggc aca tat gaa 
Glu Gly Thr Tyr Glu 
1891 / 631 
etc gac att gaa aac 
Leu Asp He Glu Asn 
1951 / 651 
gee gcg gat atg ccg 
Ala Ala Asp Met Pro 
2011 / 671 
aaa gaa ctt caa cag 
Lys Glu Leu Gin Gin 
2071 / 691 
aaa gaa get tct tat 
Lys Glu Ala Ser Tyr 
2131 / 711 
ata gta aac cgt gta 
He Val Asn Arg Val 
2191 / 731 
gca gaa gaa gaa aaa 
Ala Glu Glu Glu Lys 
2251 / 751 
ctg caa tac gtg gag 
Leu Gin Tyr Val Glu 
2311 / 771 
aaa acg gte tgc ccg 
Lys Thr Val Cys Pro 
2371 / 791 
aaa gaa caa gtg ctg 
Lys Glu Gin Val Leu 
2431 / 811 
tgt gaa atg aaa ttc 
Cys Glu Met Lys Phe 
2491 / 831 
teg att tct tea tea 
Ser He Ser Ser Ser 
2551 / 851 
tea gga gaa gcg tgg 
Ser Gly Glu Ala Trp 
2611 / 871 
gag ctg caa gaa gga 
Glu Leu Gin Glu Gly 
2671 / 891 
ctg cca aaa gac cct 
Leu Pro Lys Asp Pro 
2731 / 911 
ttt aga ggc ttt gtg 
Phe Arg Gly Phe Val 
2791 / 931 
gca cat tta tac ttc 
Ala His Leu Tyr Phe 
2851 / 951 



gaa 
Glu 


aac 
Asn 


get 
Ala 


cat 
His 


aat 
Asn 


gg a 

Gly 


acg 
Thr 


gcg 
Ala 


cgt 
Arg 


gat 

Asp 


acg 
Thr 


ctt 
Leu 


gat 
Asp 


tea 
Ser 


cae 
His 


tct 
Ser 


tat 
Tyr 


aac 
Asn 


ggt 
Gly 


cat 
His 


tct 
Ser 


get 
Ala 


gat 
Asp 


gaa 
Glu 


gta 
Val 


get 
Ala 


act 
Thr 


acg 
Thr 


tat 
Tyr 


caa 
Gin 


gaa 
Glu 


aac 
Asn 


ate 
He 


get 
Ala 


gac 
Asp 


gaa 

Glu 


tgg 

Trp 


cgt 
Arg 


gaa 

Glu 


cat 
His 


agt 
Ser 


gaa 
Glu 


gat 
Asp 


aat 
Asn 


aaa 
Lys 


ctt 
Leu 


gcg 

Ala 


aaa 

Lys 


atg 
Met 


cac 
His 


cca 
Pro 


ggc 
Gly 


agt 
Ser 


gca 
Ala 


cga 
Arg 


caa 
Gin 


gaa 
Glu 


gga 

Gly 


gat 
Asp 


cat 
His 


aca 
Thr 


gca 
Ala 


agg 
Arg 


ttc 
Phe 


ggc 

Gly 


tta 
Leu 


get 
Ala 


cat 
His 


ttg 
Leu 


cca 
Pro 


ctt 
Leu 


caa 
Gin 


gat 
Asp 


cct 
Pro 


gtt 
Val 


ccg 
Pro 


cat 
His 


aaa 
Lys 


gta 
Val 


gag 
Glu 


gca 
Ala 


aaa 
Lys 


cgt 
Arg 


tta 
Leu 


aca 
Thr 


age 
Ser 


gaa 
Glu 


ttt 
Phe 


ate 
He 


gee 
Ala 


cct 
Pro 


cgt 
Arg 


gte 
Val 


gat 
Asp 


gaa 
Glu 


age 
Ser 


gga 

Gly 


tat 
Tyr 


gga 

Gly 


gaa 
Glu 


gat 
Asp 


acg 
Thr 


att 
He 


acg 
Thr 


tgc 
Cys 


gaa 
Glu 


acg 
Thr 


ccg 
Pro 


ctt 
Leu 


ate 
He 


cag 
Gin 


gcg 

Ala 


cgc 
Arg 


aaa 
Lys 


cag 
Gin 


ggc 

Gly 


tgc 
Cys 


cgt 
Arg 


tea 
Ser 


cct 
Pro 



SUBSTITUTE SHEET (RULE 26) 



RAW SEQUENCE LISTING DATE : 11/27/2001 

PATENT APPLICATION : US/09/856, 339A TIME: 10:15:47 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 

3 <110> APPLICANT: Wong, Luet 

4 . Bell/ Stephen 

5 Carmichael, Angus 

7 <120> TITLE OF INVENTION: PROCESS FOR OXIDISING TERPENES 
9 <130> FILE REFERENCE: HO-P02196US0 (10104571) 

11 <140> CURRENT APPLICATION NUMBER: US 09/856, 339A 

12 <141> CURRENT FILING DATE: 2001-05-18 
14 <150> PRIOR APPLICATION NUMBER: PCT/GB99/03873 

^15 <151> PRIOR FILING DATE: 1999-11-19 

17 <150> PRIOR APPLICATION NUMBER: GB 9825421.2 

18 <151> PRIOR FILING DATE: 1998-11-19 
ftj 20 <160> NUMBER OF SEQ ID NOS : 23 
111 22 <170> SOFTWARE: Patentln version 3.0 
P 24 <210> SEQ ID NO: 1 
yj 25 <211> LENGTH: 1242 
|7j 26 <212> TYPE: DNA 

27 <213> ORGANISM: Pseudomonas putida 

~ 29 <220> FEATURE: 

|L 30 <221> NAME/KEY: misc_f eature 
31 <222> LOCATION: (1)..(1242) 

UJ 32 <223> OTHER INFORMATION: Coding sequence 

M 3 5 <40 0> SEQUENCE: 1 

U] 36 acgactgaaa ccatacaaag caacgccaat cttgcccctc tgccacccca tgtgccagag 60 

Q38 cacctggtat tcgacttcga catgtacaat ccgtcgaatc tgtctgccgg cgtgcaggag 120 

^40 gcctgggcag ttctgcaaga atcaaacgta ccggatctgg tgtggactcg ctgcaacggc 180 
42 ggacactgga tcgccactcg cggccaactg atccgtgfagg cctatgaaga ttaccgccac 240 
44 ttttccagcg agtgcccgtt catccctcgt gaagccggcg aagcctacga cttcattccc 300 
46 acctcgatgg atccgcccga gcagcgccag tttcgtgcgc tggccaacca agtggttggc 360 
48 atgccggtgg tggataagct ggagaaccgg atccaggagc .tggcctgctc gctgatcgag 420 
50 agcctgcgcc cgcaaggaca gtgcaacttc accgaggact acgccgaacc cttcccgata 480 
52 cgcatcttca tgctgctcgc aggtctaccg gaagaagata tcccgcactt gaaataccta 540 
54 acggatcaga tgacccgtcc ggatggcagc atgaccttcg cagaggccaa ggaggcgctc 600 
56 tacgactatc tgataccgat catcgagcaa cgcaggcaga agccgggaac cgacgctatc 660 
58 agcatcgttg ccaacggcca ggtcaatggg cgaccgatca ccagtgacga agccaagagg 720 
60 atgtgtggcc tgttactggt cggcggcctg gatacggtgg tcaatttcct cagcttcagc 780 
62 atggagttcc tggccaaaag cccggagcat cgccaggagc tgatcgagcg tcccgagcgt 840 
64 attccagccg cttgcgagga actactccgg cgcttctcgc tggttgccga tggccgcatc 900 
66 ctcacctccg attacgagtt tcatggcgtg caactgaaga aaggtgacca gatcctgcta 960 
68 ccgcagatgc tgtctggcct ggatgagcgc gaaaacgcct gcccgatgca cgtcgacttc 1020 
70 agtcgccaaa aggtttcaca caccaccttt ggccacggca gccatctgtg ccttggccag 1080 
72 cacctggccc gccgggaaat catcgtcacc ctcaaggaat ggctgaccag gattcctgac . 1140 
74 ttctccattg ccccgggtgc ccagattcag cacaagagcg gcatcgtcag cggcgtgcag 1200 
76 gcactccctc tggtctggga tccggcgact accaaagcgg ta 1242 

79 <210> SEQ ID NO: 2 

80 <211> LENGTH: 3150 

81 <212> TYPE: DNA * * \ 

82 <213> ORGANISM: Bacillus megaterium 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/856 , 339A 



DATE: 11/27/2001 
TIME: 10:15:47 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 



84 <220> FEATURE: 

85 <221> NAME/KEY: misc__feature 

86 <222> LOCATION: (1)..(3150) 

87 <223> OTHER INFORMATION: Coding sequence 

90 <400> SEQUENCE: 2 

91 atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 
93 ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 
95 tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 
97 gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 
99 gattttgcag gagacgggtt atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 
101 cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 
103 gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 
105 gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 

q107 tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 
^109 gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 
j;ill gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 
p;113 attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca tatgctaaac 
JJ115 ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 
U [117 acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 
yill9 ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 
UJ121 gatcctgctc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 
;Jjl23 gaagcgctgc gcttatggcc aactgctcct gcgttttccc tatatgcaaa agaagatacg 
£ 12 5 gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 
^127 cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 
■j;l2 9 gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 
**fl31 tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 
fll33 cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 
|jjl35 aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 
M13 7 tcacctagca ctgaacagtc tgccaaaaaa gcacgcaaaa aggcagaaaa cgctcataat 
Ml 3 9 acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 
141 ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 
143 gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 
145 ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 
147 aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 
149 aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 
151 cgcggtgaag cagatgcaag cgacgacttt gaaggcacat atgaagaatg gcgtgaacat 
15 3 atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 
155 tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 
157 ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 
159 agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 
161 ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 
163 ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 
165 ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 
167 acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 
169 cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 
171 atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 
173 cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 
175 aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 
177 tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/856, 33 9A 



DATE: 11/27/2001 
TIME: 10:15:48 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 



179 tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 
.181 atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 
183 ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 
185 catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 
187 cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 
189 gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 
191 ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 
193 gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 
195 cgatacgcaa aagacgtgtg ggctgggtaa 

198 <210> SEQ ID NO: 3 

199 <211> LENGTH: 7 

200 <212> TYPE: PRT 

201 <213> ORGANISM: Artificial Sequence 
203 <220> FEATURE: 

p 204 <221> NAME/KEY: DOMAIN 
yl 205 <222> LOCATION: (1)..(7) 

fr|206 <223> OTHER INFORMATION: Synthetic linker 

fff209 <400> SEQUENCE: 3 

.^211 Thr Asp Gly Thr Ser Ser Thr 

fl 212 1 5 

214 <210> SEQ ID NO: 4 
y 215 <211> LENGTH: 7 
1J216 <212> TYPE: PRT 

k 217 <213> ORGANISM: Artificial Sequence 

Cl219 <220> FEATURE: 

gj22 0 <221> NAME/KEY: DOMAIN 

|11 221 <222> LOCATION: (1) . . (7) 

IP|222 <223> OTHER INFORMATION: Synthetic Linker 

%225 <400> SEQUENCE: 4 

ff227 Thr Asp Gly Ala Ser Ser Ser 

^228 1 5 

230 <210> SEQ ID NO: 5 

231 <211> LENGTH: 17 

232 <212> TYPE: PRT 

233 <213> ORGANISM: Artificial Sequence 
235 <220> FEATURE: 

23 6 <221> NAME/KEY: DOMAIN 

237 <222> LOCATION: (1)..(17) 

238 <223> OTHER INFORMATION: Synthetic Linker 
241 <400> SEQUENCE: 5 

243 Thr Asp Gly Thr Arg Pro Gly Pro Gly Pro Gly Pro Gly Pro Ser Ser 



24 6 Thr 

249 <210> SEQ ID NO: 6 

250 <211> LENGTH: 21 

251 <212> TYPE: PRT 

252 <213> ORGANISM: Artificial Sequence 

254 <220> FEATURE: 

255 <221> NAME/KEY: DOMAIN 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3150 



244 1 



5 



10 



15 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/856 , 3 39A 



DATE: 11/27/2001 
TIME: 10:15:48 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 



256 <222> LOCATION: (1)..(21) 

257 <223> OTHER INFORMATION: Synthetic Linker 
260 <400> SEQUENCE : 6 

262 Thr Asp Gly Thr Arg Pro Gly Pro Gly Pro Gly Pro Gly Pro Gly Pro 

263 15 10 15 

265 Gly Pro Ser Ser Thr 

266 20 



268 <210> SEQ ID NO: 7 

269 <211> LENGTH: 4 

270 <212> TYPE: PRT 

271 <213> ORGANISM: Artificial Sequence 
27 3 <22 0> FEATURE: 

274 <221> NAME/KEY: DOMAIN 

275 <222> LOCATION: (1)..(4) 

Q 276 <223> OTHER INFORMATION: Synthetic Linker 
•yO 279 <400> SEQUENCE: 7 
pj 281 Pro Leu Glu Leu 
Hi 282 1 

%l 284 <210> SEQ ID NO: 8 
fl 285 <211> LENGTH: 7 
f;t 286 <212> TYPE: PRT 

W 287 <213> ORGANISM: Artificial Sequence 

'A 289 <220> FEATURE: 

s 290 <221> NAME/KEY: DOMAIN 

p 291 <222> LOCATION: (1)..(7) 

jO. 292 <223> OTHER INFORMATION: Synthetic Linker 

fl 295 <400> SEQUENCE: 8 

U 2 97 Thr Asp Gly Gly Ser Ser Ser 

tU298 1 5 

300 <210> SEQ ID NO: 9 
^ 301 <211> LENGTH: 51 

302 <212> TYPE: DNA 

303 <213> ORGANISM: Artificial Sequence 

305 <220> FEATURE: 

306 <221> NAME/KEY: misc_f eature 

307 <222> LOCATION: (1)..(51) 

308 <223> OTHER INFORMATION: Synthetic Primer 

311 <400> SEQUENCE: 9 

312 gagattaaga attcataaac acatgggagt gcgtgccata tgaacgcaaa c 51 

315 <210> SEQ ID NO: 10 

316 <211> LENGTH: 3 6 

317 <212> TYPE: DNA 

318 <213> ORGANISM: Artificial Sequence 

320 <220> FEATURE: 

321 <221> NAME/KEY: variation 

322 <222> LOCATION: (1)..(36) 

323 <223> OTHER INFORMATION: Synthetic desired coding sequence 

326 <400> SEQUENCE: 10 

327 gaactgagta gtgccactga cggaggatcc tcatcg 36 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/856 f 339A 



DATE: 11/27/2001 
TIME: 10:15:48 



Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 



330 <210> SEQ ID NO: 11 

331 <211> LENGTH: 36 

332 <212> TYPE: DNA 

333 <213> ORGANISM: Artificial Sequence 

335 <220> FEATURE: 

336 <221> NAME/KEY: misc_f eature 

337 <222> LOCATION: (1)..(36) 

338 <223> OTHER INFORMATION: Synthetic Primer 

341 <400> SEQUENCE: 11 

342 cgatgaggat cctccgtcag tggcactact cagttc 36 

345 <210> SEQ ID NO: 12 

346 <211> LENGTH: 36 

347 <212> TYPE: DNA 

348 <213> ORGANISM: ARTIFICIAL SEQUENCE 
p. 350 <220> FEATURE: 

jl 351 <221> NAME/KEY: misc„f eature 
§J 352 <222> LOCATION: (1)..(36) 

jjl 353 <223> OTHER INFORMATION: Synthetic Primer 
^1356 <400> SEQUENCE: 12 

f\ 357 tcatcgggat cctcatcgat gtctaaagta gtgtat 36 
1**360 <210> SEQ ID NO: 13 
4f 361 <211> LENGTH: 52 
S 362 <212> TYPE: DNA 

« 3 63 <213> ORGANISM: Artificial Sequence 

p 365 <220> FEATURE: 

fli 366 <221> NAME/KEY: misc_f eature 

367 <222> LOCATION: (1)..(52) 
\n 368 <223> OTHER INFORMATION: Synthetic desired coding sequence 
^371 <400> SEQUENCE: 13 

372 cccgataggc aatggtaatc atcgggagtc tagagcatcg aagctttcat eg 52 

375 <210> SEQ ID NO: 14 

376 <211> LENGTH: 52 

377 <212> TYPE: DNA 

378 <213> ORGANISM: Artificial Sequence 

380 <220> FEATURE: 

381 <221> NAME/KEY: misc_f eature 

382 <222> LOCATION: (1)..(52) 

383 <223> OTHER INFORMATION: Synthetic Primer 

386 <400> SEQUENCE: 14 

387 cgatgaaagc ttcgatgetc tagactcccg atgattacca ttgectateg gg 52 

390 <210> SEQ ID NO: 15 

391 <211> LENGTH: 24 

392 <212> TYPE: DNA 

393 <213> ORGANISM: Artificial Sequence 

395 <220> FEATURE: 

396 <221> NAME/KEY: misc_f eature 

397 <222> LOCATION: (1)..(24) 

398 <223> OTHER INFORMATION: Synthetic Primer 
401 <400> SEQUENCE: 15 
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VERIFICATION SUMMARY DATE : 11/27/2001 

PATENT APPLICATION: US/09/856, 339A TIME: 10:15:49 

Input Set : A:\Sequence Listing.txt 
Output Set: N:\CRF3\11212001\I856339A.raw 



o 

03 

m 
01 
w 
w 

© 

o 
m 

ui 
a 
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SEQUENCE LISTING 

<110> Isis Innovation Limited 
Wong, Luet 
Bell, Stephen 
Carmichael, Angus 

<120> Process for Oxidising Terpenes 

<130> P02196US0 

<140> US 09/856,339 
<141> 2001-05-18 

<150> PCT/GB99/038 73 
<151> 1999-11-19 

<150> GB 9825421.2 
<151> 1998-11-19 

<160> 23 

<170> Patentln version 3.0 

<210> 1 
<211> 414 
<212> PRT 

<213> Pseudomonas putida 
<400> 1 

Thr Thr Glu Thr lie Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro Pro 

15 10 15 

His Val Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro Ser 
20 25 30 

Asn Leu Ser Ala Gly Val Gin Glu Ala Trp Ala Val Leu Gin Glu Ser 
35 40 45 

Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp lie 
50 55 60 

Ala Thr Arg Gly Gin Leu lie Arg Glu Ala Tyr Glu Asp Tyr Arg His 
65 70 75 80 

Phe Ser Ser Glu Cys Pro Phe lie Pro Arg Glu Ala Gly Glu Ala Tyr 
85 90 95 

Asp Phe lie Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe Arg 
100 105 110 

Ala Leu Ala Asn Gin Val Val Gly Met Pro Val Val Asp Lys Leu Glu 
115 120 125 

Asn Arg lie Gin Glu Leu Ala Cys Ser Leu lie Glu Ser Leu Arg Pro 
130 135 140 

Gin Gly Gin Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro lie 
145 150 155 160 



25060937 



- 1 - 



Arg lie Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp lie Pro His 
165 170 175 

Leu Lys Tyr Leu Thr Asp Gin Met Thr Arg Pro Asp Gly Ser Met Thr 
180 185 190 

Phe Ala Glu Ala Lys Glu Ala Leu Tyr Asp Tyr Leu lie Pro lie lie 
195 200 205 

Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala lie Ser lie Val Ala 
210 215 220 

Asn Gly Gin Val Asn Gly Arg Pro lie Thr Ser Asp Glu Ala Lys Arg 
225 230 235 240 

Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn Phe 
245 250 255 

Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg Gin 
2 60 2 65 27 0 

Cj 

tfJ Glu Leu lie Glu Arg Pro Glu Arg lie Pro Ala Ala Cys Glu Glu Leu 
CO 275 280 285 

in 

01 Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg lie Leu Thr Ser Asp 
290 295 300 

i ■ \ 

!5 T Y r Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin lie Leu Leu 
m 305 310 315 320 

P Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro Met 
CJ 325 330 335 

fjl His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly His 
340 345 350 

Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu lie lie 
355 360 365 

Val Thr Leu Lys Glu Trp Leu Thr Arg lie Pro Asp Phe Ser lie Ala 
370 375 380 

Pro Gly Ala Gin lie Gin His Lys Ser Gly lie Val Ser Gly Val Gin 
385 390 395 400 

Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 
405 410 

<210> 2 
<211> 1242 
<212> DNA 

<213> Pseudomonas putida 
<400> 2 

acgactgaaa ccatacaaag caacgccaat cttgcccctc tgccacccca tgtgccagag 60 
cacctggtat tcgacttcga catgtacaat ccgtcgaatc tgtctgccgg cgtgcaggag 120 
gcctgggcag ttctgcaaga atcaaacgta ccggatctgg tgtggactcg ctgcaacggc 180 
ggacactgga tcgccactcg cggccaactg atccgtgagg cctatgaaga ttaccgccac 24 0 



25060937 



- 2 - 





ttttccagcg 


agtgcccgtt 


catccctcgt 


gaagccggcg 


aagcctacga 


cttcattccc 




acctcgatgg 


atccgcccga 


gcagcgccag 


tttcgtgcgc 


tggccaacca 


agtggttggc 




atgccggtgg 


tggataagct 


ggagaaccgg 


atccaggagc 


tggcctgctc 


gctgatcgag 




agcctgcgcc 


cgcaaggaca 


gtgcaacttc 


accgaggact 


acgccgaacc 


cttcccgata 




cgcatcttca 


tgctgctcgc 


aggtctaccg 


gaagaagata 


tcccgcactt 


gaaataccta 




acggatcaga 


tgacccgtcc 


ggatggcagc 


atgaccttcg 


cagaggccaa 


ggaggcgctc 




tacgactatc 


tgataccgat 


catcgagcaa 


cgcaggcaga 


agccgggaac 


cgacgctatc 




agcatcgttg 


ccaacggcca 


ggtcaatggg 


cgaccgatca 


ccagtgacga 


agccaagagg 




atgtgtggcc 


tgttactggt 


cggcggcctg 


gatacggtgg 


tcaatttcct 


cagcttcagc 


•is— 


atggagttcc 


tggccaaaag 


cccggagcat 


cgccaggagc 


tgatcgagcg 


tcccgagcgt 




attccagccg 


cttgcgagga 


actactccgg 


cgcttctcgc 


tggttgccga 


tggccgcatc 


.£;:::=; 
U = 


ctcacctccg 


attacgagtt 


tcatggcgtg 


caactgaaga 


aaggtgacca 


gatcctgcta 


UJ 


ccgcagatgc 


tgtctggcct 


ggatgagcgc 


gaaaacgcct 


gcccgatgca 


cgtcgacttc 




agt cgccaaa 


aggt t 1 c aca 


O d. O ei O L L L 


ggccacggca 


gccat ct gtg 


ccttggccag 




cacctggccc 


gccgggaaat 


catcgtcacc 


ctcaaggaat 


ggctgaccag 


gattcctgac 




ttctccattg 


ccccgggtgc 


ccagattcag 


cacaagagcg 


gcatcgtcag 


cggcgtgcag 


ui 


gcactccctc 


tggtctggga 


tccggcgact 


accaaagcgg 


ta 





<210> 3 
<211> 1049 
<212> PRT 

<213> Bacillus megaterium 
<400> 3 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
20 25 30 

lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
85 90 95 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1242 
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Trp Lys Lys Ala 
100 

Met Lys Gly Tyr 
115 

Gin Lys Trp Glu 
130 

Asp Met Thr Arg 
145 

Tyr Arg Phe Asn 



Ser Met Val Arg 
180 

Asn Pro Asp Asp 
195 

Asp lie Lys Val 
210 

Lys Ala Ser Gly 
225 

Gly Lys Asp Pro 



Tyr Gin lie lie 
260 

Leu Leu Ser Phe 
275 

Gin Lys Ala Ala 
290 

Ser Tyr Lys Gin 

305 

Glu Ala Leu Arg 



Lys Glu Asp Thr 
340 

Glu Leu Met Val 
355 

Gly Asp Asp Val 
370 

Ala lie Pro Gin 
385 

Cys lie Gly Gin 



Met Met Leu Lys 



His Asn lie Leu 



His Ala Met Met 
120 

Arg Leu Asn Ala 
135 

Leu Thr Leu Asp 
150 

Ser Phe Tyr Arg 
165 

Ala Leu Asp Glu 



Pro Ala Tyr Asp 
200 

Met Asn Asp Leu 

215 

Glu Gin Ser Asp 
230 

Glu Thr Gly Glu 
245 

Thr Phe Leu lie 



Ala Leu Tyr Phe 
280 

Glu Glu Ala Ala 
295 

Val Lys Gin Leu 
310 

Leu Trp Pro Thr 
325 

Val Leu Gly Gly 



Leu lie Pro Gin 
360 

Glu Glu Phe Arg 
375 

His Ala Phe Lys 
390 

Gin Phe Ala Leu 
405 

His Phe Asp Phe 



Leu Pro Ser Phe 
105 

Val Asp lie Ala 



Asp Glu His lie 
140 

Thr lie Gly Leu 
155 

Asp Gin Pro His 
170 

Ala Met Asn Lys 
185 

Glu Asn Lys Arg 



Val Asp Lys lie 
220 

Asp Leu Leu Thr 
235 

Pro Leu Asp Asp 
250 

Ala Gly His Glu 
265 

Leu Val Lys Asn 



Arg Val Leu Val 
300 

Lys Tyr Val Gly 
315 

Ala Pro Ala Phe 
330 

Glu Tyr Pro Leu 
345 

Leu His Arg Asp 



Pro Glu Arg Phe 
380 

Pro Phe Gly Asn 
395 

His Glu Ala Thr 
410 

Glu Asp His Thr 



Ser Gin Gin Ala 
110 

Val Gin Leu Val 
125 

Glu Val Pro Glu 



Cys Gly Phe Asn 
160 

Pro Phe lie Thr 
175 

Leu Gin Arg Ala 
190 

Gin Phe Gin Glu 
205 

lie Ala Asp Arg 



His Met Leu Asn 
240 

Glu Asn lie Arg 
255 

Thr Thr Ser Gly 
270 

Pro His Val Leu 

285 

Asp Pro Ala Pro 



Met Val Leu Asn 
320 

Ser Leu Tyr Ala 
335 

Glu Lys Gly Asp 
350 

Lys Thr lie Trp 
365 

Glu Asn Pro Ser 



Gly Gin Arg Ala 
400 

Leu Val Leu Gly 
415 

Asn Tyr Glu Leu 
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to 
i pi 



420 425 430 

Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

Glu Gin Ser Ala Lys Lys Ala Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
485 490 495 

Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 
500 505 510 

Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

Ala Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 560 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
565 570 575 

Thr Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala 
580 585 590 

Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
645 650 655 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
660 665 670 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu 
675 680 685 

Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 

Pro Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 

Leu Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu 
725 730 735 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 
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Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
805 810 815 

Glu Phe lie Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He 
820 825 830 

Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 
835 840 845 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys 
865 870 875 880 

Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
885 890 895 

Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 
900 905 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr 
945 950 955 960 

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
965 970 975 

Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 
980 985 990 

Gin Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin 
1010 1015 1020 

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu 
1025 1030 1035 

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 



<210> 4 
<211> 3150 
<212> DNA 
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<213> Bacillus megaterium 
<400> 4 

atgacaatta aagaaatgcc tcagccaaaa 
ttaaacacag ataaaccggt tcaagctttg 
tttaaattcg aggcgcctgg tcgtgaatca 
aaatttgtac gtgtaacgcg ctacttatca 
gattttgcag gagacgggtt atttacaagc 
cataatatct tacttccaag cttcagtcag 
gtcgatatcg ccgtgcagct tgttcaaaag 
gaagtaccgg aagacatgac acgtttaacg 
tatcgcttta acagctttta ccgagatcag 
gcactggatg aagcaatgaa caagctgcag 
gaaaacaagc gccagtttca agaagatatc 
attgcagatc gcaaagcaag cggtgaacaa 
ggaaaagatc cagaaacggg tgagccgctt 
acattcttaa ttgcgggaca cgaaacaaca 
ttagtgaaaa atccacatgt attacaaaaa 
gatcctgctc caagctacaa acaagtcaaa 
gaagcgctgc gcttatggcc aactgctcct 
gtgcttggag gagaatatcc tttagaaaaa 
cttcaccgtg ataaaacaat ttggggagac 
gaaaatccaa gtgcgattcc gcagcatgcg 
tgtatcggtc agcagttcgc tcttcatgaa 
cactttgact ttgaagatca tacaaactac 
aaacctgaag gctttgtggt aaaagcaaaa 
tcacctagca ctgaacagtc tgccaaaaaa 
acgccgctgc ttgtgctata cggttcaaat 
ttagcagata ttgcaatgag caaaggattt 
gccggaaatc ttccgcgcga aggagctgta 
ccgcctgata acgcaaagca atttgtcgac 
aaaggcgttc gctactccgt atttggatgc 




acgtttggag agcttaaaaa tttaccgtta 60 

atgaaaattg cggatgaatt aggagaaatc 120 

cgctttgata aaaacttaag tcaagcgctt 180 

agtcagcgtc taattaaaga agcatgcgat 240 

tggacgcatg aaaaaaattg gaaaaaagcg 300 

caggcaatga aaggctatca tgcgatgatg 360 

tgggagcgtc taaatgcaga tgagcatatt 420 

cttgatacaa ttggtctttg cggctttaac 480 

cctcatccat ttattacaag tatggtccgt 540 

cgagcaaatc cagacgaccc agcttatgat 600 

aaggtgatga acgacctagt agataaaatt 660 

agcgatgatt tattaacgca tatgctaaac 720 

gatgacgaga acattcgcta tcaaattatt 780 

agtggtcttt tatcatttgc gctgtatttc 840 

gcagcagaag aagcagcacg agttctagta 900 

cagcttaaat atgtcggcat ggtcttaaac 960 

gcgttttccc tatatgcaaa agaagatacg 1020 

ggcgacgaac taatggttct gattcctcag 1080 

gatgtggaag agttccgtcc agagcgtttt 1140 

tttaaaccgt ttggaaacgg tcagcgtgcg 1200 

gcaacgctgg tacttggtat gatgctaaaa 1260 

gagctggata ttaaagaaac tttaacgtta 1320 

tcgaaaaaaa ttccgcttgg cggtattcct 1380 

gcacgcaaaa aggcagaaaa cgctcataat 14 40 

atgggaacag ctgaaggaac ggcgcgtgat 1500 

gcaccgcagg tcgcaacgct tgattcacac 1560 

ttaattgtaa cggcgtctta taacggtcat 1620 

tggttagacc aagcgtctgc tgatgaagta 1680 

ggcgataaaa actgggctac tacgtatcaa 1740 
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• 



aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800 

cgcggtgaag cagatgcaag cgacgacttt gaaggcacat atgaagaatg gcgtgaacat 1860 

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920 

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980 

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 204 0 

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100 

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160 

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220 

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280 

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340 

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400 

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 24 60 

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520 

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 258 0 

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640 

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700 

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760 

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820 

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880 
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120 

cgatacgcaa aagacgtgtg ggctgggtaa 3150 

<210> 5 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<400> 5 

gagattaaga attcataaac acatgggagt gcgtgccata tgaacgcaaa c 51 

<210> 6 
<211> 15 
<212> PRT 



2940 
3000 
3060 
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<213> Artificial Sequence 
<400> 6 

Thr His Arg Ala Ser Pro Gly Leu Tyr Gly Leu Tyr Ser Glu 
15 10 

<210> 7 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<400> 7 

gaactgagta gtgccactga cggaggatcc tcatcg 



<210> 8 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<400> 8 

cgatgaggat cctccgtcag tggcactact cagttc 



<210> 9 

<211> 6 

<212> PRT 

<213> Artificial Sequence 

<400> 9 

Gly Leu Tyr Ser Glu Arg 
1 5 

<210> 10 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<400> 10 

tcatcgggat cctcatcgat gtctaaagta gtgtat 



<210> 11 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<400> 11 

cccgataggc aatggtaatc atcgggagtc tagagcatcg aagctttcat 



<210> 12 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<400> 12 

cgatgaaagc ttcgatgctc tagactcccg atgattacca ttgcctatcg 
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<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<400> 13 

tcatcgacgc gtcgcgaact gctg 24 



<210> 14 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<400> 14 

cccgataggc aatggtaagt aggtgaatat ctaatcccca tctatgcgcg agtggagtct 60 
agagttcga 69 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<400> 15 

tcatcgaagc ttggctgttt t 21 



<210> 16 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<400> 16 

acaatttcac acaggatcta gaccatatgt catcgaagct ttcatcg 47 



<210> 17 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<400> 17 

cgatgaaagc ttcgatgaca tatggtctag atcctgtgtg aaattgt 47 



<210> 18 
<211> 21 
<212> PRT 

<213> Artificial sequence 
<400> 18 

Thr His Arg Ala Ser Pro Gly Leu Tyr Thr His Arg Ser Glu Arg Ser 
15 10 15 

Glu Arg Thr His Arg 
20 
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<210> 19 
■<211> 21 
<212> PRT 

<213> Artificial sequence 
<400> 19 

Thr His Arg Ala Ser Pro Gly Leu Tyr Ala Leu Ala Ser Glu Arg Ser 
15 10 15 

Glu Arg Ser Glu Arg 
20 

<210> 20 
<211> 17 
<212> PRT 

<213> Artificial sequence 
<400> 20 

Thr Asp Gly Thr Arg Pro Gly Pro Gly Pro Gly Pro Gly Pro Ser Ser 
15 10 15 

Thr 



<210> 21 
<211> 21 
<212> PRT 

<213> Artificial sequence 
<400> 21 

Thr Asp Gly Thr Arg Pro Gly Pro Gly Pro Gly Pro Gly Pro Gly Pro 
15 10 15 

Gly Pro Ser Ser Thr 
20 

<210> 22 
<211> 4 
<212> PRT 

<213> Artificial sequence 

<400> 22 

Pro Leu Glu Leu 
1 

<210> 23 
<211> 21 
<212> PRT 

<213> Artificial sequence 
<400> 23 

Thr His Arg Ala Ser Pro Gly Leu Tyr Gly Leu Tyr Ser Glu Arg Ser 
15 10 15 

Glu Arg Ser Glu Arg 
20 
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CLAIMS 

1. Process for oxidising a substrate which is an 
acyclic or cyclic terpene, or a cycloalkene; or a 
substituted derivative thereof, which process comprises 
oxidising said compound with a mutant haem-containing 
enzyme, the mutant comprising the substitution of an 
amino acid in the active site by an amino acid with a 
less polar side-chain. 

2. Process according to claim 1 in which the 
enzyme is a mutant of P450 cam or P4 50 BM _ 3 , or a mutant of a 
naturally occurring homologue of either of these enzymes. 

3. Process according to claim 2 in which the 
enzyme is one in which amino acid 47 and/or 51 of 
P450 BM „ 3 , or amino acid 96 of P450 ca;n/ or the equivalent 
amino acid(s) in a said homologue, have been changed to 
an amino acid with a less polar side-chain. 

4. Process according to any one of claims 1, 2 or 
3 in which there are one or more other amino acid 
substitutions in the active site. 

5. Process according to any one of the preceding 
claims in which the enzyme is (i) P450 cam and comprises 
one or more of the following mutations: F87W, F87I, F87L, 
T185L, T185F, V247A, V247L or F87A-I395F; or (ii) P450 BM _ 3 
and comprises the mutation R47L-Y51F. 

6. An enzyme as defined in claim 4 or 5 excluding 
mutants of P450 cam which only have the mutations F87A- 
Y96G-F193A, F87A-Y96G-F193A-C334A, or T101M-T185F-V247M. 

7. A polynucleotide which comprises a sequence 
which encodes an enzyme as defined in claim 6. 

8. A cell which expresses: 

(i) an enzyme as defined in any one of claims 2 to 6 
which in its naturally occurring form has an electron 
transfer reductase domain, or 

(ii) (a) an enzyme as defined in any one of claims 1 to 

5; 
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(b) an electron transfer reductase; and 

(c) an electron transfer redoxin. 

9. A cell which expresses: 

(a) (i) P450 cam , or a fragment thereof; or 

5 (ii) a naturally occurring homologue of P450 cara 

or a fragment thereof; or 
(iii)a mutant P45G cam , or a mutant 
homologue of thereof 

as defined in any one of claims 2 to 5; or 
10 (iv) a P450 cam which has ar least 70% amino acid 

homology with (i) , (ii) or (iii)and 
optionally has the mutations defined in 
any one of claims 3 to 5; and 

(b) an electron transfer reductase; and 
15 (c) an electron transfer redoxin: 

excluding an E. Coli DH5a cell in which the only 
mutants of P450 cam which are expressed are amongst the 
following: 

H 2 N-P4 50 cam -TDGTSST-putidaredoxin reductase-TDGASSS- 
2 0 putidaredoxin-COOH, 

H 2 N-P450 cara -TDGTRPGPGPGPGPSST-putidaredoxin 
reductase-TDGASSS-putidaredoxin-COOH, 

H 2 N-P4 5 Q cam -TDGTRPGPGPGPGPGPGPSST-put idar edoxin 
reductase-TDGASSS-putidaredoxin-COOH, 
25 H 2 N-putidaredoxin reductase-TDGASSS-putidaredoxin- 

PLEL - P 4 5 0 cam - COOH . 

10. A cell according to claim 8 or 9 in which (a), 
(b) and (c) or (b) and (c) are expressed together in the 
same fusion protein. 

30 11. A cell according to any one of claims 8 to 10 

in which: 

<b) is putidaretoxin reductase or a fragment 
thereof; and/or 

(c) is putidaretoxin or a fragment thereof. 

35 
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12. A cell which expresses: 

(a) (i) P450 BM _ 3 , or a fragment thereof; or 

(ii) a naturally occurring homologue of P450 BM _ 3 
or a fragment thereof; or 
5 (iii)a mutant P450 BM _ 3 , or a mutant 

homologue of thereof 

as defined in any one of claims 2 to 5. 

13. A process according to any one of claims 1 to 5 
in which the compound is oxidised in a cell according to 

10 any one of claims 8 to 12 

14. A process for making a library of mutants of 
P450 cam or P450 BM _ 3/ or mutants of a homologue of either of 
these enzymes comprising contacting a cell according to 
any one of claims 9, and 10 and 11 when dependent on 

15 claim 9, or according to claim 12 or an E.Coli DH5a cell 

as defined in claim 9; 

with a mutagen and/or when then the cell is a 

mutator cell culturing the cell in conditions in which 

mutants are produced. 
20 15. Process for selecting a mutant of P450 cam or 

P450 BM _ 3 , or a homologue thereof, for its ability to 

oxidise a particular substrate, which process comprises 

screening a group of said mutants for their oxidation 

effect on the particular substrate. 
25 16. Process according to claim 15 in which the 

mutant is additionally selected for its ability to 

oxidise the particular compound to a particular oxidation 

product . 

17. A process according to claim 15 or 16 in which 
30 the screening is carried out on the library made in a 

process according to claim 14. 

18. A process for producing a library of oxidation 
products comprising providing a substrate as defined in 
claim 1 to a library made in a process according to claim 

35 14 and allowing oxidation of the substrate. 
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19. An oxidation product obtained by a process 
according to any one of claims 1 to 5 and 13 wherein 
optionally the enzyme is one which has been selected in a 
process according to claim 15, 16 or 17 for use in a 

5 method of treatment of the human or animal body by 
therapy, 

20. A pharmaceutical composition comprising an 
oxidation product obtained by a process according to any 
one of claims 1 to 5 and 13 wherein optionally the enzyme 

10 is one which has been selected in a process according to 
claim 15, 16 or 17 and a pharmaceutically acceptable 
carrier or diluent. 
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